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Abstract. One major open conjecture in the area of critical random graphs, formulated hy 
statistical physicists, and supported by a large amount of numerical evidence over the last 
decade ^3 


^ is as follows: for a wide array of random graph models with degree 
exponent t e (3,4), distances between typical points both within maximal components in 
the critical regime as well as on the minimal spanning tree on the giant component in the 
supercritical regime scale like 

In this paper we study the metric space structure of maximal components of the mul¬ 
tiplicative coalescent, in the regime where the sizes converge to excursions of Levy pro¬ 
cesses “without replacement” , yielding a completely new class of limiting random metric 
spaces. A by-product of the analysis yields the continuum scaling limit of one fundamen¬ 
tal class of random graph models with degree exponent t e (3,4) where edges are rescaled 
by yielding the first rigorous proof of the above conjecture. The limits in this 

case are compact “tree-like” random fractals with a dense collection of hubs (infinite degree 
vertices), a finite number of which are identified with leaves to form shortcuts. In a special 
case, we show that the Minkowski dimension of the limiting spaces equal (t - 2)/(t - 3) a.s., 
in stark contrast to the Erdos-Renyi scaling limit whose Minkowski dimension is 2 a.s. It is 
generally believed that dynamic versions of a number of fundamental random graph models, 
as one moves from the barely subcritical to the critical regime can be approximated by the 
multiplicative coalescent. In work in progress, the general theory developed in this paper is 
used to prove analogous limit results for other random graph models with degree exponent 
TE (3,4). 

Our proof makes crucial use of inhomogeneous continuum random trees (ICRT), which 
have previously arisen in the study of the entrance boundary of the additive coalescent. We 
show that tilted versions of the same objects using the associated mass measure, describe 
connectivity properties of the multiplicative coalescent. Since convergence of height pro¬ 
cesses of corresponding approximating p-trees is not known, we use general methodology in 
[Til and develop novel techniques relying on first showing convergence in the Gromov-weak 
topology and then extending this to Gromov-Hausdorff-Prokhorov convergence by proving a 
global lower mass-bound. 
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1. Introduction and results 


In the last two decades many results regarding scaling limits of large discrete random ob¬ 
jects to continuum analogs have been proved. Examples range from Aldous’s continuum 
random tree Schramm-Loewner evolution and critical planar systems 1^, to what 


is most closely related to this paper: scaling limits of maximal components in the critical 
regime for random graphs as well as the minimal spanning tree on the giant component in 
the supercritical regime |3}|^ . 

Motivated by empirical observations on real-world networks, in the last decade, re¬ 
searchers from a wide array of fields including computer science, the social sciences and 
statistical physics have proposed a large number of random graph models to explain various 
functionals of real world systems including power law degree distributions and small world 
scaling of distances between nodes in the network |[^|^|^|^|^|^[^[^ . Many of these 
models have a parameter t related to the edge density and a model-dependent critical point 
tc- Writing n for the number of vertices in the network, lit <tc then the maximal connected 
component ^i(n) has size that is negligible compared to n, while if t> tc one has a giant 
component ^i(n) ~ f{t)n for some positive model-dependent function f{t) > 0 for t > tc- 
The “t - tc” regime is often referred to as the critical regime. Just as a study of the classical 
critical Erdos-Renyi random graph spurred enormous activity in probabilistic combinatorics 
in the 90s , the study of the critical regime of these new random graph models 

and new phenomena such as explosive percolation ||^|^ have motivated a concerted effort 
to understand the critical regime of these new random graph models. 


In this context, for more than a decade (^|^|^|^, one of the fundamental open con¬ 
jectures in this area (loosely stated) is as follows. Consider distances between typical points 
in the maximal component either in the critical regime or the minimal spanning tree on the 
giant component in the supercritical regime scale 


(a) If the random graph model has an asymptotic degree distribution with finite third mo¬ 
ments, then distances scale like 

(b) Ifthe random graph model has a limiting degree distribution with tail pk ~ Clk^ 

for T e (3,4), then distances scale like ^C“3 )/(t-i)_ 


Contributions of this paper: Since we will need to setup some notation before getting to the 
main results, let us give a general overview of the contributions of this paper: 

(i) General theory: The fundamental aim of the paper is to develop a general theory one 
can use to prove (b) in the conjecture above for a wide class of random graphs and, 
in particular, derive a new class of continuum scaling limits. To do so, we consider 
the multiplicative coalescent with entrance boundary in the space Iq as in (see 
(ns below). Viewing the maximal components as measured metric spaces (using 
graph distance and vertex weights), we show that these components with edges and 
associated measures properly rescaled converge to continuum random objects in the 
Gromov weak sense. These resulting objects are obtained via appropriate tilts and ver¬ 
tex identifications of inhomogeneous continuum random trees; untiltedvexsions of the 
same objects have been used to describe the entrance boundary of the additive coales¬ 
cent These resulting random objects are “tree-like” but with a dense collection of 
“hubs” (corresponding to infinite-degree vertices). 
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(ii) Proof techniques: The standard technique in proving such results is to study height 
processes of certain spanning trees of the components and to show that these processes 
converge to limiting excursions that code the limiting random real trees. In our context, 
the convergence of height processes of the corresponding approximating p-trees is not 
known. In (^, the height processes of p-trees were shown to converge to limiting ex¬ 
cursions in certain regimes, but these results are not applicable to our situation. 

Because of this, we develop new techniques relying on first showing convergence in 
Gromov-weak topology via a careful analysis of the tree spanning a finite collection of 
“typical” points in random “tilted" p-trees. In one fundamental class of random graph 
models, we then extend Gromov-weak convergence to Gromov-Hausdorff-Prokhorov 
convergence by proving a global lower mass-bound. 

(iii) Special case: As an example of the general theory, we study the special case of the 
Norros-Reittu model (which in the regime of interest has been proven to be 
equivalent to the Chung-Lu model and the rank-one random graph 1^). In this 


case, we show that the limiting spaces are compact. We also show that the box-counting 
or Minkowski dimension equals (t - 2) / (t - 3) a.s. 


In work in progress (^, we use the general theory in this paper to analyze another funda¬ 
mental random graph model, the configuration model with degree distribution with expo¬ 
nent T G (3,4), and derive the continuum analogs of the maximal components of this model. 
We defer a more detailed discussion of related work and the relevance of the current study 
to SectionIHl 


Organization of the paper: A reasonable amount of notation regarding the entrance bound¬ 
ary of the multiplicative coalescent is required to describe the main results (Theorems |1.8[ 
[L^ . To ease the reader into the paper, we start in Section [lT] with the special case of 
the Norros-Reittu model and in Theorem |1.2| describe what the main results imply for this 
model. Then in Section [L2] we define the multiplicative coalescent as well as the class of en¬ 
trance boundaries of importance for the paper and then describe the two main results. The 
results use two notions of convergence of metric spaces; these are given a precise formula¬ 
tion in Section f 


Section |2^ describes an important class of random trees called p-trees 
and the corresponding inhomogenous continuum random trees that arise as scaling limits 
of these objects. These are then used in Section|2^to give a precise description of the scaling 
limits of maximal components. We discuss the relevance of the main results, relate these to 
existing work and give an overview of the proof in Section]^ The proofs of the main results 
are contained in Sections |4]-[3 


Notation: Throughout this paper, we make use of the following standard notation. We let 

d P 

—► denote convergence in distribution, and —► convergence in probability. For a sequence 

p 

of random variables (X„)„^i, we write = Op(fi„) when \X„\/bn —^ 0 as n ^ oo. For a non¬ 
negative function n g(n), we write fin) - 0(g(n)) when |/(n) |/g(n) is uniformly bounded, 
and fin) - oigin)) when limn^oofin)/gin) - 0. Furthermore, we write fin) - 0(g(n)) if 
fin) - Oigin)) and gin) - Oifin)). We say that a sequence of events iSn)n^i occurs with 
high probability (whp) when ViSfi 1- 


1.1. Rank- one random graph. 
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1.1.1. Model formulation. We start by describing a particular class of random graph models 

sometimes also re¬ 


called the Poissonian random graph or the Norros-Reittu model 
ferred to as the rank-one random graph model 


In the regime of interest for this paper, 


as shown in this model is equivalent to the Chung-Lu model |29| - (3^ and the Britton- 
Deijfen-Martin-Ldf model ||^. Start with vertex set [n] {1,2. n} and suppose each ver¬ 

tex i e [n] has a weight Wi ^ 0 attached to it; intuitively this measures the propensity or 

attractiveness of this vertex in the formation of links. Writing w = {wi . Wn), place an edge 

between i and j independently for each if j ^[n] with probability 


qij = qijiw) := l-expi-WiWj/£n), ( 1 - 1 ) 

where is the total weight given by 

£n ■— ^ ^i■ 

ieln] 

To complete the formulation, we need to specify how these vertex weights are chosen. Es¬ 
sentially we want the empirical distribution of weights n~^Y.iE[n]S {Wi} to converge to a fixed 
pre-specified distribution F as n ^ oo. There are a number of ways to do this, but for this 
paper the following choice turns out to be convenient for a clear statement of the results. Let 
iuq)ie[n] be constructed by 

Wi:^[l-Fr^ii/n), ie[n], (1.2) 

where F is a cumulative distribution function on [0,oo) and [1 - F]“^ is the generalized in¬ 
verse 

[l-F]“HM):=inf{s: [1-F](s) ^ u}. 

We assume there exists r e (3,4) and Cp > 0 such that 

lim x^”^[l-F(x)] := Cp. (1.3) 

X—►OO 

We will use W for a random variable with distribution F. We will use NR„(m/) to denote the 
corresponding random graph. 


1.1.2. Motivation and known results. As described in the introduction, one impetus for the 
formulation of a wide array of network models, is to capture the heterogeneous and heavy¬ 
tailed nature of the degree distribution of empirical networks. Write for the number of 
vertices with degree k in NR„(u/). Under the assumptions in the previous section, one can 
show (2^ Theorem 3.13] that 


Nk p 

— -Ue 
n 


-w 


Id 


k^O, 


(1.4) 


where W ~ F. In particular, the degree distribution also has tail exponent t. More important 
in the context of this paper is the connectivity threshold. For i ^ 1 write for the ith largest 
connected component and let \% \ denote its number of vertices. Now define the parameter 


E(W2) 
E(W) ’ 


(1.5) 


and note that v < oo by l |1.3i l. Then by Theorem 3.1 and Section 16.4], we have the 
following criterion for the phase transition for the largest component: 
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p 

(a) Supercritical regime: If v > 1, then there exists p e (0,1) such that —► p whilst 

\^2\ln^0] 

(h) Subcritical regime: If v < 1, then |S^i|/n ^ 0. 

p 

The main aim of this paper is to understand the critical regime v = 1 where also \'^i\ln —► 
0. In this setting, there are different universality classes depending on the vertex weights. 
In the Erdos-Renyi or weakly inhomogeneous universality class, critical clusters have size of 
order and their metric space structure was discovered hy Addario-Berry Broutin and 
Goldschmidt |j^. Interestingly, when E(VE^) < oo, component sizes still scale like 
while assuming finite 6 + £-moments the metric space structure of rank-1 inhomogeneous 
random graphs is (apart from a trivial rescaling of size and time) the same . However, in 
the strongly inhomogeneous regime where E(VE^) = oo, the scaling limits of critical clusters 
are dramatically different in the sense that their sizes are gives hy where t is the 

degree power-law exponent given hy ( |1.3t (^|^. In this paper, we focus on their metric 
space structure, obtained after rescaling edges hy taking the limit as n ^ oo. 

We show that this limiting metric space is compact and its Minkowski dimension equals 
(t - 2) / (t - 3), whereas the Erdos-Renyi scaling limit has Minkowski dimension 2. 

In this paper, we analyze the entire critical scaling window. Let w denote the weight se¬ 
quence as in and fix A £ IS. Now consider the weight sequence wiX) (i£;(A))(e[n] 
defined by 


16 


m/(A) := 


1-t 


A 


n 


(t-3)/(t-1) 


W. 


Write NR„(m/(A)) for the corresponding random graph and let ^/(A) denote the corre¬ 
sponding jth largest component. Then this critical scaling window was first identified and 
studied in where it was shown that for every fixed A e IR, as well as 

^(t- 2 )/(t-i)/|c^^I The entire distributional asymptotics of component sizes were 

derived in where it was shown that in the product topology on 


\%a)\ 

^„(t-2)/(t-I) 



(ZdA): i^l). 


where (Z, (A ): i^l) are excursions away from zero of a special stochastic process described 
in more detail in Section fL^ 


1.1.3. Our results. We make the following convention: 

Eor any metric measure space (S, d, p) and a > 0, aS denotes the metric mea¬ 
sure space (S, ad,p), i.e, the space where the distance is scaled by a and the 
measure remains unchanged. 

Consider the random graph NR„(u^(A)) and view each connected component ^ as a con¬ 
nected metric space via the usual graph distance where each edge has length one. Eurther, 
we can view each connected component as a metric measure space by assigning weight 
ujj) to vertex i e Note that the normalization yields a probability measure on 
each connected component. Let denote the space of (equivalence classes) of compact 
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measured metric spaces equipped with the Gromov-Hausdorff-Prokhorov metric (see Sec- 

M”(A):=(^dA): J>1) (1.6) 


tion|2.1.l1for definition). View 


as a random element of 

Next recall that the lower and upper hox counting dimensions of a compact metric space 
are given hy 


dim(.y^) := liminf 

- 5|0 


log(l/d) 


and 


dim(.^) := limsup 
510 


log [^(^,6)] 
log(l/d) 


respectively, where is the minimal number of open halls with radius 5 required 

to cover Also let dim;j(.^) denote the Hausdorff dimension of When dim (.y^) = 
dim(.y^) = dim, then the hox-counting or Minkowski dimension is dim. 

Before stating our main result, we introduce a technical condition. 


Assumption 1.1. The support of the limiting distribution F (defined just before isgiven 
by [t,oo) for some i > 0. Further, F has a continuous density f on [i,cx3) such that xf{x) is 
non-increasing on [t,oo). 

Note that distributions F that are exact power laws, i.e., of the form F(x) = 1 - {ilxY~^ for 
X > L and some t e (3,4), satisfy Assumption ! 1.1[ The main result of this section is as follows: 

Theorem 1.2 (Scaling limits with degree exponent t g (3,4)). Fix A g IR and consider the crit¬ 
ical Norros-Reittu model NR„(m/(A)), i.e, assume thatv = 1 where v is as in Assume that 
the limiting distribution F satisfies Assumption \\.\\ 

Then, there exists an appropriate limiting sequence of random compact metric measure 
spacesM^iA) := (M™(A)),^i such that the components in the critical regime satisfy 

^(t- 3 !/(t-i) M"''(A) ^ M”(A), asn^oo. (1.7) 

Here convergence is with respect to the product topology on induced by the Gromov- 
Hausdorff-Prokhorov metric on each coordinate y. For each i ^ 1, the limiting metric spaces 
have the following properties: 

(a) M“(A) is random compact metric measure space obtained by taking a random real tree 
tJi (A) and identifying a random (finite) number of pairs of points (thus creating shortcuts). 

(b) Call a point ueSTi (A) a hub point if deleting the u results in infinitely many disconnected 
components ofyfiX). Then iTilX) has infinitely many hub points which are everywhere 
dense on the tree STi (A). 

(c) The box-counting or Minkowski dimension ofMH (A) satisfies 

dim(MH(A)) = - —^ a.s. (1.8) 

Consequently, the Hausdorff dimension satisfies the bound dimhiM™ [X)) ^ (t-2)/(t-3) 
a.s. 


Conjecture 1.3. We strongly believe that both the Hausdorff dimension and the packing di¬ 
mension of MHiX) equalH-3) a.s. See Section^for a discussion. 
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1.2. Connectivity asymptotics for the multiplicative coalescent. In this section we con¬ 
sider a slightly more general setting than in Section [Llj The motivation is as follows: re¬ 
call that for the rank-one model, two vertices were connected with essentially probability 
proportional to the product of the weight between these two vertices. For probabilists, this 
connectivity pattern is quite reminiscent of the famous multiplicative coalescent ||^ [To)[T^ . 
Whilst interesting in its own right, its fundamental importance in the context of random 
graphs is as follows: A wide array of random graph models can be constructed in a dynamic 
fashion where as time progresses new edges are created between pre-existing clusters. Even 
though the merging dynamics between connected components tend to be quite different 
from that specified by the multiplicative coalescent, the mergers from the barely subcritical 
regime through the critical scaling window can be approximated by the multiplicative coa¬ 
lescent. This idea was exploited in |jT^ to prove universality of scaling limits in the critical 
regime for several random graphs models. 

Thus components at criticality of a wide array of random graph models can be thought of 
consisting of two major parts: 


(a) “Blobs" that are components formed in the barely subcritical regime. 

(b) Edges formed between such blobs as the system proceeds from the barely subcritical 
regime through the critical scaling window. 

The results below (in particular Theorem |1.8[ specify how to handle the second aspect. In 
a companion paper we show how one can use macroscopic averaging of distances within 
blobs in random graph models such as the configuration model to show that these models 
also have the same scaling limit in the critical regime as Theorem |1.2| in the setting where 
degrees obey power-laws with exponents r g (3,4). Eurther, it will follow from Theorem |1.8| 
that the convergence in l |1.7| l holds with respect to the product topology induced by Gromov- 
weak topology on each coordinate. Therefore, Theorem |1.2| can be recovered partially from 
the more general Theorem |1.8| at the expense of working with a weaker topology. 

Before stating the result we will need to define the multiplicative coalescent. The natural 
domain of this Markov process is the space 



X = (xi, X2,...): Xi ^ X2 ^ ^ 0, 


<°o}’ 


(1.9) 


equipped with the metric <i(x,y) := - yt)^. We will work in the simpler setup where 

the Markov process starts with a finite number of clusters, i.e, the process starts with x g f ^ 
such that 3n <oo such that Xi - 0 for i > n. Write (n) for the collection of such vectors. 
Now the Markov process (X(t))f^o with initial state X(0) = x evolves as follows. Write X(t) = 
iXiit))i^i. Then for i j, clusters i and j merge at rate Xi{t) ■ Xj{t) into a single cluster of 
size Xi{t) + Xj{t). 

Note that for any fixed time f > 0, it is easy to find the distribution of masses X(f) via the 
following random graph: 


Definition 1.4 (Random graph ^^^(x, t)). Consider the vertex set [n] := {1,2. n} and assign 

weight Xi to vertex i. Now connect each pair of vertices i,j with i f j independently with 
probability 


qij := \-exp[-tXiXj). 


(1.10) 
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Call this random graph t). For a connected component c t), let mass(^) := 
Xi. Let {'^i{t)) denote the connected components arranged in decreasing order of their 
masses. 


The following is obvious from the definition of the multiplicative coalescent: 


Lemma 1.5. For each fixed f^O, the masses of the multiplicative coalescent at time t started 
with finite number of initial clusters with masses x satisfies 

iXiit): i^l) = (mass(S^/(f)): i ^ l). 


Analogous to ( |1.9t , consider the two spaces 

:= |c:= (ci,C 2 ,...) : Cl ^ C 2 ^ 0, ^ cf <oo|, (1-11) 


These spaces turn out to be crucial in describing the entrance boundary of the eternal multi¬ 
plicative coalescent in . In the context of this paper, we are interested in studying scaling 
limits of connected components of the random graph t) when (suitably normalized) 
asymptotics of the weight vector x are described by a vector celo. Let 

(T,(x):=^x[, l^r^3. 

i 

We will make the following assumptions about the weight vector x : = x(n) used to form the 
graph t). These place the associated graph in a particular entrance boundary of the 
associated eternal multiplicative coalescent Proposition 7]. 


Assumption 1.6. For each n^ letx'^"^ - (x'"’ : I ^ i ^ n) be an initial finite-length vector 
belonging to L^(n). Suppose that as n^oo there exists celo such that 


(73 (X‘”’) 

((72 (x''”))^ 

xf 

—— - Cj for] ^ 1, and 

(72(X<">)) 

(72(x‘">)-0. 


( 1 . 12 ) 

(1.13) 

(1.14) 


Now let : 7 ^ 1} be a sequence of independent exponential random variables where 
has rate cj for each j For a fixed A e [R, consider the process 


y^(s) := A5-t^(c7]l{^7 ^ sj-cjs), s^O. 
j 


It turns out that this process is well defined precisely if c e ^, 110 


zero process 


K^(5) := y"(5) - min I/"( 5 '), 


(1.15) 

. Consider the “reflected at 

(1.16) 


and the excursions of V"^(-) from zero. Then Aldous and Limic 10 showed that the lengths of 
these excursions are a.s. in l^ precisely when cef, and thus can be arranged in decreasing 
order. Write 


J^(A):=(:?dA): 1) 


(1.17) 
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for these excursions in decreasing order of their length. Let Zd A): = | Zt (A) | denote the length 
of the ith largest excursion and let 

Z(A) := (ZdA): 1) G f J a.s. (1.18) 

Then Aldous and Limic proved the following result: 

Theorem 1.7 ([^ Proposition 7]). FixAe U and consider the time scale tn := A+ [cr 2 (x‘"’)]“^. 
Under Assumptions ( |1.12| |, ( |1.13K l |1.14h the masses of the connected components of the graph 
^ni^, tn) satisfy 


( 

mass 




A + 


cr2(xf">) 



asn^oo, 


with respect to the topology in whereZiX) is as in | |1.181 l. 


Now consider the connected components in (x, t), and as before, view each component 
^ as a connected metric space via the usual graph distance where each edge has length one. 
Further, view each component as a measured metric space hy assigning mass Xi i mass(S^) 
to each vertex i g Let 5^^ denote the space of (equivalence classes) of measured metric 
spaces equipped with Gromov-weak topology (see Section[2^.1.2|for definition) and view 


M„(A): = 


'^j|a -t 


(72 (x'"’) 




^ 1 


as a random element in Then our next result is about Gromov-weak convergence of 
M„(A). 


Theorem 1.8. Fix A g K. Then under Assumption \l.6\ there exist an appropriate limiting 
sequence of metric spaces M^IX) (Mf(A): i ^ 1) such that 

(T 2 (x‘"')M„(A) ^M^(A), asn^oo. 

Here weak convergence is on which is equipped with the natural product topology in¬ 
duced by the Gromov-weak topology on each coordinate y. 

Remark 1. A full description of the limit objects is given in Section [273| The limit objects use 
tilted versions of inhomogeneous continuum random trees and checking compactness even 
of the original versions at this level of generality turns out to be quite intractable. However 
as the next theorem shows, in the special case of relevance to the rank-one model, one can 
prove much more. 


Consider the special sequence c = c(a,T) := iCiia,T): i ^ 1) e Iq with t g (3,4) and a > 0, 
where 

a 

Ci(a,T) := (1-19) 

Then we have the following result about the limiting metric spaces: 

Theorem 1.9. Fix a > 0, t g (3,4) and let c = c(a, r) as in | |1.19t . Consider the limiting metric 

spacesM^(A) := (M^(A): 1). 

Then almost surely M^iX) is compact for every i ^ 1. Further, the Minkowski dimension of 
Mf(A) satisfies 

dimiM^iX)) = ^ 


a.s. 


( 1 . 20 ) 
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Consequently, theHausdorffdimension satisfies the bounddimhiM^iX)) ^ (t-2)/(t-3) a.s. 


Remark 2. Since we are dealing with equivalence classes of metric spaces (see Sections 2.1.1 


and|2X^, Theorem |1.9| should he understood as claiming the existence of representative 
spaces Mf (A) that are compact, and satisfy the said conditions about the fractal dimensions. 
We will only work with these representative spaces throughout this paper. 


2. Definitions and limit objects 


2.1. Convergence of metric spaces. Proper notions of convergence of (measured) metric 
spaces is one of the central themes in this paper. Here we define the two topologies used in 
the statement of our results. We mainly follow fT|[2^[38)[^ . 


2.1.1. Gromov-Hausdorjf-Prokhorov metric. In this section, all metric spaces under consid¬ 
eration will he compact metric spaces with associated prohahility measures. Let us first 
recall the Gromov-Hausdorff distance den between metric spaces. Fix two metric spaces 
(Xi, di) and {Xz, dz). For a subset CqXix Xz, the distortion of C is defined as 

dis(C) := sup{|di(xi,yi) -d 2 (x 2 ,y 2 )l: (xi,X 2 ), (yi,y 2 ) g C}. 


A correspondence C between Xi and Xz is a measurable subset of Xi x Xz such that for every 
xi G Xi there exists at least one X2 g Xz such that (xi,X2) g C and vice-versa. The Gromov- 
Hausdorff distancebetween the two metric spaces {Xi, di) and {Xz, dz) is defined as 

dcuiXiyXz) - ^inf{dis(C): C is a correspondence between Xi and Xz}. 


Suppose iXi,di) and {Xz,dz) are two metric spaces and pi g Xi, and pz g Xz. Then the 
pointed Gromov-Hausdorff distancebetween Xi{Xi, di, pi) and Xz := {Xz, dz, pz) is given 

by 

d^^iXi,Xz) = ^inf{dis(C): C is a correspondence between Xi and Xz and {pi,pz) g G^.l) 

We will need a metric that also keeps track of associated measures on the corresponding 
spaces. A compact measured metric space {X, d, p) is a compact metric space {X, d) with 
an associated probability measure p on the Borel sigma algebra ^{X). Given two compact 
measured metric spaces iXi,di,pi) and [Xz,dz,pz) and a measure n on the product space 
Xi X Xz, the discrepancy of n with respect to pi and pz is defined as 

D[Ti-,pi,pz):^\\pi-ni\\ + \\pz-nz\\, 


where ni,Tiz are the marginals of tt and || ■ |l denotes the total variation distance between 
probability measures. Then the Gromov-Haussdorf-Prokhorov distance between Xi and Xz 
is defined as 


dcHP iXi,Xz):^ inf < max 


U 


dis(C), Din]pi,pz), n[C^) 


( 2 . 2 ) 


where the infimum is taken over all correspondences C and measures ;r on Ai x Xz. 


Similar to Ha), we can define a “pointed Gromov-Hausdorff-Prokhorov distance", df^ 


GHP 


between two metric measure spaces Xi and Xz having two distinguished points p\ and pz 
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respectively by taking the infimum in I l2.2| | over all correspondences C and measures n on 
Xi X Xz such that (pi, pz) £ C. 

Write y' for the collection of all measured compact metric spaces {X, d, p). The function 
doHP is a pseudo metric on y, and defines an equivalence relation X ~ Y o dcHpi^- T) = 0 
on y. Let y := y I ~ he the space of isometry equivalent classes of measured compact 
metric spaces and Jghp the induced metric. Then by Q, {y, ^ghp) is a complete separable 
metric space. To ease notation, we will continue to use {y, ^ghp) instead of {y, ^ghp) and 
X = {X, d, jX) to denote both the metric space and the corresponding equivalence class. 


2.1.2. Gromov-weak topology. Here we mainly follow 138 . Introduce an equivalence rela¬ 
tion on the space of complete and separable metric spaces that are equipped with a prob¬ 
ability measure on the associated Borel a-algebra by declaring two such spaces {Xi,di,p\) 
and {Xz, dz, pi) to be equivalent when there exists an isometry y/: support(pi) ^ support(p 2 ) 
such that pz - y^*Pi ■- Pi ° i-e., the push-forward of pi under y/ is pz- Write y^ for the 

associated space of equivalence classes. As before, we will often ease notation by not distin¬ 
guishing between a metric space and its equivalence class. 

Fix m ^ 2, and a complete separable metric space {X, d). Then given a collection of points 
x:= (xi,X 2 ,...,x^) e A'”,letD(x) := idiXi,Xj))ij-e[m] denote the symmetric matrix of pairwise 

distances between the collection of points. A function O: y^ —>• K is called a polynomial of 

2 

degree m if there exists a bounded continuous function cj): U'l ^ [R such that 


0((A,d,p)):= j d>(D(x))p®'”(d(x)). 


(2.3) 


Here p®'” is the m-fold product measure of p. Let n denote the space of all polynomials on 


Definition 2.1 (Gromov-weak topology). A sequence {Xn,dn,Pn)n^\^y* is said to converge 
to {X, d, p) e y^ in the Gromov-weak topology if and only j/0((A'„, dn,Pn)) ‘1^((A', d, p)) for 
all (hen. 

In Theorem 1] it is shown that y* is a Polish space under the Gromov-weak topology. 
It is also shown that, in fact, this topology can be completely metrized using the so-called 
Gromov-Prokhorov metric. 


2.1.3. Spaces of trees with edge lengths, leaf weights and root-to-leaf measures. In the proof of 
the main results we need the following two spaces built on top of the space of discrete trees. 
The first space T// was formulated in |[^[^ where it was used to study trees spanning a 
finite number of random points sampled from an inhomogeneous continuum random tree 
(as described in the next section). We use the same notation in this paper. 

The space T//: Fix / ^ 0 and / ^ 1. Let T// be the space of trees having the following proper¬ 
ties: 


(a) There are exactly / leaves labeled l-t,...,/-i-, and the tree is rooted at another labeled 
vertex 0-t. 

(b) There may be extra labeled vertices (called hubs) with distinct labels in {1,2./}. (It is 

possible that only some, and not all labels in {1,2,..., /} are used.) 

(c) Every edge e has a strictly positive edge length f. 
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A tree t e T// can be viewed as being composed of two parts: 

(1) shape(t) describing the shape of the tree (including the labels of leaves and hubs) but 

ignoring edge lengths. The set of all possible shapes is obviously finite for fixed I, J. 

(2) The edge lengths l(t) := {le'.ee t). Consider the product topology on T// consisting of the 

discrete topology on and the product topology on K"* where m is the number of edges 

oft. 

The space T*^: We will need a slightly more general space. Along with the three attributes 
above in T//, the trees in this space have the following two additional properties. Let if (t) := 
{!+,...,/+} denote the collection of non-root leaves in t. Then every leaf u e if (t) has the 
following attributes: 

(d) Leaf weights: A strictly positive number Aiv). Write A(t) := (Aiv): v e i?(t)). 

(e) Root-to-leaf measures: A probability measure Vt,t, on the path [O-t, v] connecting the 
root and the leaf v. Here the path is viewed as a line segment pointed at O-t and has the 
usual Euclidean topology. Write v(t) := (vt,y : u e for this collection of probability 
measures. 

In addition to the topology on T//, the space T*^ with these additional two attributes inher¬ 
its the product topology on IR-^ owing to leaf weights and (dQHp)^ owing to the root-to-leaf 
measures. 

For consistency, we add to the spaces T// and T*^ a conventional state d. Its use will be 
clear later on. 

2.2. Random p-trees and inhomogeneous continuum random trees (ICRTs). For fixed 
m ^ 1, write Jm and for the collection of all rooted trees with vertex set [m] and rooted 

ordered trees with vertex set [m] respectively. Here we will view a rooted tree as being di¬ 
rected with the root being the original progenitor and each edge being directed from child 
to parent. An ordered rooted tree is a tree where children of each individual are assigned an 
order (meant to describe for example orientation in a planar embedding, e.g., right to left or 
some notion of age, e.g., oldest to youngest). 

In this section, we define a family of random tree models called p-trees |^|^, and their 
corresponding limits, the so-called inhomogeneous continuum random trees, which play 
a key role in describing the limit metric spaces as well as in the proof. Fix m ^ 1, and a 

probability mass function p = {pi,p 2 . Pm) with pt > 0 for all i e [m]. A p-tree is a random 

tree in with law as follows. For any fixed t e Jm and vet, write dpit) for the number of 
children of v in the tree t. Then the law of the p-tree, denoted by IP tree- is defined as: 

Ptree (t) = Ptree (t; p) = Pu'^ , teJm- (2.4) 

ve[m] 

Generating a random p-tree S' ~ Ptree and then assigning a uniform random order on the 
children of every vertex v eS' gives a random element with law Pordl'l p) given by 

„4y(t) 

Pord(t)=Pord(t:p)= n -SnJWV ^2.5) 

Obviously a p-tree can be constructed by first generating an ordered p-tree with the above 
distribution and then forgetting about the order. 
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In a series of papers pTfjl^ it was shown that p-trees, under various assumptions, con¬ 
verge to inhomogeneous continuum random trees that we now describe. Recall the space f | 
in Consider the subset 0 c given by 


0:={0:=(0,-: 1) G ^ 0. = oo, 

^ 1=1 



( 2 . 6 ) 


Now recall from that a real tree is a metric space d) that satisfies the following 

for every pair a,be ST: 


(a) There is a unique isometric map fa,b'- [0,d(a,fi)] ^ S' such that fa,bib) - 
a, fa,bid{a,b)) = b. 

(b) For any continuous one-to-one map g : [0,1] ^ with g(0) = a and g(l) = b, we have 

g([0,l]) = fa,bi[b,dia, b)]). 


Construction of the ICRT: Given 0 g 0, we will now define the inhomogeneous continuum 
random tree We mainly follow the notation in |j^ . Assume that we are working on a 
probability space Pg) rich enough to support the following: 


(a) For each i ^ 1, let 5^/ := (^j,i,^/, 2 ,..-) be a rate 6i Poisson process, independent for dif¬ 
ferent i. The first point of each process ^i,i is special and is called ajoinpoint, whilst the 
remaining points ^i,j with j '^2 will be called i-cutpoints 113 . 


(b) Independent of the above, let U - [Uj : j ^ 1, 1) be a collection of i.i.d. uniform (0,1) 

random variables. These are not required to construct the tree but will be used to define 
a certain function on the tree. 


The random real tree (with marked vertices) is then constructed as follows: 


(i) Arrange the cutpoints {^ij : i^lj^ 2} in increasing order as 0 < iji < rjz < ■"■ The 
assumption that Y.i 9^ <oo implies that this is possible. For every cutpoint 0^ = ^i,j, let 
Vt ■- ^i,i he the corresponding joinpoint. 

(ii) Next, build the tree inductively. Start with the branch [ 6 , 771 ]. Inductively assuming we 
have completed step k, attach the branch {.rjk,Vk+i] to the joinpoint 77 * corresponding 
to 77 A:. 


Write for the corresponding tree after 


one has used up all the branches 
3 1, the joinpoint corresponds to a 
The ICRT is the completion of the 
Section 2], this is a real-tree as defined above 
which can be viewed as rooted at the vertex corresponding to zero. We call the vertex 
corresponding to joinpoint hub i. Since Y.i 9i = 00 , one can check that hubs are almost 
everywhere dense on 


[0,771],{(77fc,77A:+i] Note that for every i 

vertex with infinite degree. Label this vertex i. 
marked metric tree 3'^. As argued in 


13 


Remark 3. The uniform random variables (17“ : 7 ^ 1, 7 ^ 1 ) give rise to a natural ordering 
on .Tjf, (or a planar embedding of .Tj®,) as follows. For 7 ^ 1 , let {3'p : 7 ^ 1) be the collection 
of subtrees hanging off of the 7 th hub. Associate Uj^ with the subtree S'l‘\ and think of .7^.“ 
appearing “to the right of" 3'.^^ if < Uj\ This is the natural ordering on 3'^^ when it is 

being viewed as a limit of ordered p-trees. We can think of the pair {3'^y U) as the ordered 
ICRT. 
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Figure 2.1. An illustration of the ICRT construction with four point process 
: 1 ^ I ^ 4}. The red points represent the joinpoint of the corresponding 
point process and the blue points the corresponding cutpoints. The last line 
contains the union of the four point processes. See Figure [2^ for the corre¬ 
sponding tree. 



Figure 2.2. The tree constructed via the stick-hreaking construction from 
Figure [27T| 


Reduced tree Fix 1^0 and / ^ 1. Now let rjo - 0 and for j ^ 0 call vertex pj the jth 
sampled leaf and label this as j+ to differentiate this from hub j. Note that the subtree of 

spanned by {0+, 1+./+} (namely the part of the tree constructed from the interval 

[0,77/]) is a tree in the usual sense with random edge lengths. For all hubs i, if i ^ I, retain its 
label and remove the label otherwise. This gives a random element of T// (recall the definiton 
Section 2.1.31, which we denote by rl'J’. See Figure |2.3| corresponding to the stick-breaking 
construction in Figures[0]and[2l2| 
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Figure 2.3. Reduced tree corresponding to the tree in Figure [2^ 


Mass measure: For every vertex u e define the degree of v to he the number of con¬ 
nected components of \{v}. Vertices with degree one are called leaves of and all 
other vertices form the skeleton of the tree. Let denote the set of leaves of 3'^^y In 


113 , it was shown that one can associate to 3',^y a natural prohahility measure called the 


mass measure satisfying = 1 . 

Root-to-vertex path measures: Now using the collection of uniform random variables 
above, we will define a function 0 (^) on the tree as well as a collection of measures on paths 
emanating from the root. Recall that the hubs in have infinite degrees. Let {3'P : j 

be the collection of subtrees of hub i in (labeled in some fashion). For each y e ST^y let 


<5(cc)(y) = 


^ x]l{yG5-/>} 


(2.7) 


We will show in our proof that (S(c»)^) is finite for almos t every realization of and for /i- 
almost every y e 3’^-^ (see Lemma 4.9 and Theorem 4.15 below). For y e 3'^y let [p, y] denote 


the path from the root p to y. For every y, define a probability measure on [p, y] as 




diUf 


if V is the ith hub and y e 3’-'' for some j. 


( 2 . 8 ) 


25,«„(y) 

Thus, this probability measure is concentrated on the hubs on the path from y to the root. 

Remark 4. Note that both 0(^,(O andQ^’(-) depend on the realization of the pair f/), 
but we chose to suppress them to avoid cumbersome notation. 


jj. ixxv, uxww , jj above. Recall that rjj is the vertex in the tree 


cor- 


Random tree Recall the tree 
responding to leaf 7 + for 1 ^ 7 ^ /. To each of these / leaves, associate the value (5^^) {rjj), and 
associate the probability measure to the path [0+, 7 +]. This tree is a random element of 


the space T*^ (see Section 2.1.3|, which we denote by ^ 


^(00) 

'ij • 
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2.3. Continuum limits of components. The aim of this section is to give an explicit descrip¬ 
tion of the limiting (random) metric spaces in Theorem|1.8[ We start hy constructing a spe¬ 
cific tilted version of the ICRT in Section l2.3.11 Then in Section l2.3.2l we describe the limits 
of maximal components. 


2.3.1. TiltedICRTsand vertex identification. Let (n.,^,Pg) and.J^®, he as in Section2.2 and 
let 7 > 0 a constant. Informally, the construction goes as follows: We will first tilt the distri¬ 
bution of the original ICRT using the functional 


:=exp 



(5i^fy)lJ.idy) 


(2.9) 


f ) ir 'ir 

to get a tilted tree . We then generate a random but finite number iV,^, of pairs of points 
yk).l^k^ The final metric space is obtained by creating “shortcuts" by identi¬ 
fying the points Xk and y^. Formally the construction proceeds in four steps: 

(a) Tilted ICRT: Define Pg on D by 

dP* exp 0(00) (y)p(dy)) 

dPg E[exp(7/_^g_5-e0(^)(x)/i(dx))] 

The expectation in the denominator is with respect to the original measure Pg. In our 
proof we will show that this object is finite. Write (^f;*,/i*) and U* = : i,j ^ 1) 

for the tree and the mass measure on it, and the associated random variables under this 
change of measure. 

(b) Poisson number of identification points: Conditionally on generate 

, having a Poisson (A^j) distribution, where 


A •= 


t/ e. 


0(oo)(y))U*(dy) 


!>1 




(c) 

(d) 


Here, ^ 1) denotes the collection of subtrees of hub i in (As mentioned 

before in Remark 4 0(oo) (0 depends on the realization of the ordered ICRT. appears 

in the expression above as the function 0(^) acts on y g for which the associated 
order is described by U*.) 

0 ★ 

“First” endpoints (of shortcuts): Conditionally on (a) and (b), sample Xk from 
with density proportional to 0(oo)(x))U*(dx) for 1 ^ A: ^ Ai,^,. 

“Second” endpoints (of shortcuts) and identification: Having chosen Xk, choose yk 
from the path [p, Xk] joining the root p and Xk according to the probability measure 


as in | |2.8| but with replacing (7“. (Note that yk is always a hub on [p, Xk].) Identify 
Xk and yk, i.e., form the quotient space by introducing the equivalence relation Xk ~ yk 
forl^k^N*,. 


Definition 2.2. Fixy^ 0 andff £ 0 as in ( |2.61 l. Lef^ooiOyj) be the metric measure space con¬ 
structed via the four steps above equipped with the measure inherited from the mass measure 
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In our proofs, we will always think of the leaf end (of a shortcut or a surplus edge) as the first 
endpoint, and the second endpoint will he selected from the skeleton. 


2.3.2. Limits of the components. Fbc A g IR and c g Zq as in I jl.lll l and consider the setting of 
Theorem |1.8[ We will need 2 main objects: 

(a) The process V^i-) in dl.lGl l. Recall that the excursions of this process from zero could he 
arranged in increasing order of lengths as (A). Let E'"’ = {cj : g J?/) denote the point 

process of jumps of the process corresponding to the excursion J?;(A). Abusing 
notation we will write = {cj : j g J?;). 


(b) The actual lengths of these excursions (Z/(A ): i^l) as in dl.lS l. 

From these objects, for each fixed Z ^ 1, define the random variable y'”’ and the point process 
= [Of : j G as 


- ZdA). 


r 




t 

/ E 

0® := 

-.jeZiU) 

^ ve2i[X) 




( 2 . 10 ) 


Our proof (see Proposition|5.l) will imply that 0'"’ g 0 as in d2.61 l a.s. Define 


- 1/2 


FdA):=ZdA) Y. 


\ve2im I 

and generate the random metric measure spaces 

MDA):=FdA)-<;^oo(0®,f‘‘’), 


where ‘^ooiO, 7) is as described in Section 2.3.1 and the metric spaces are conditionally inde¬ 
pendent across i given the driving parameters in d2.101 l. Let M^(A) = (Mf(A): Z ^ 1). Then 
this is the limiting collection of metric spaces in Theorem |1.8[ 

To describe the sequence of spaces M™(A) appearing in Theorem|1.2[ define 


c”:=(c” :;>!), 


where 




1 

EW I 7 J 


( 2 . 11 ) 







-1 U2/F-1) Z2/(t-1) 


and 


Here W is a random variable with distribution F as in dl.31 l. Then 

1 


M“(A) = 

“ EW 


m- 




(A + O 
EW 


( 2 . 12 ) 


(2.13) 


3. Discussion 

We describe the two major motivations for developing the general theory of this paper in 
Sections |3A] and [372] In Sections |3^ and |3^ we include a brief discussion about ICRTs as 
well as give an overview of the order in which the proofs are carried out. 
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3.1. Universality and domains of attraction of critical random graph models. One natural 
question the reader might ask at this point is why the general theory in Section [L2| why not 
just stick to the rank-one random graph model as in Section [LT| As we have described in the 
introduction, the aim of this paper is the development of general theory applicable to a wide 
array of models. What does one mean by this? It turns out that many different random graph 
models can be constructed in a dynamic fashion as a graph-valued process t ^ 0} 

where edges are added as time advances thus resulting in mergers of components as t ] 
tc- In this construction, there is a critical time tc (model-dependent) such that the giant 
component emerges after time tc- 

Now for most random graph models (including the configuration model) the dynamics of 
mergers of components starting at time zero do not look like the multiplicative coalescent. 
However if one were to zoom in at the critical time tc, for many models, there exists £„ | 0 
such that if one were to look at the interval [tc - £«, tc + En\, then mergers of components 
can be approximated by the multiplicative coalescent. Here tc - Sn often corresponds to the 
barely subcritical regime of the random graph. Thus if one had good control over component 
functionals at the barely subcritical time tc-Sn and in particular if one was able to show that 
component sizes appropriately normalized satisfied Assumption ] 1.6[ then one can use The- 
orem jl.Sj to derive convergence at the critical time tc of the maximal components. Note that 
one does not expect component sizes at time tc - £« to satisfy assumptions of the Norros- 
Reittu model in ( |1.4] i. Rather in most cases, at time tc - £«, the expected size of the compo¬ 
nent of a randomly selected vertex would scale like while the maximal component 
would scale like (ignoring logarithmic corrections) where 5i < 62 are related to various 
scaling exponents of the system. In work in progress , Theorem |1.9| coupled with delicate 
estimates of various scaling exponents for the configuration model in the barely subcritical 
regime, proves analogous results for the configuration model with degree exponent t e (3,4). 
Sizes of maximal components in the critical regime including the heavy-tailed regime for this 
model was previously analyzed in ||^. Further as was done in ||T^, where a number of suf¬ 
ficient conditions for the domain of attraction of the critical Erdos-Renyi scaling limits were 
derived, we hope to derive similar general conditions for a random graph model to belong 
to the same domain of attraction as the rank-one model with t e (3,4), established in this 
paper. 


3.2. Minimal spanning tree on inhomogeneous random graphs. As described in the intro¬ 
duction, a second major motivation for the technical analysis in this paper is the minimal 
spanning tree. To fix ideas, consider the Norros-Reittu model in the supercritical regime 
(the parameter in | |1.51 l v > 1). To each edge attach a random edge weight i.i.d. across edges, 
assumed to be derived from a continuous distribution. Consider the minimal spanning tree 
(MST) of the giant component. A large amount of simulation-based evidence from statistical 
physics suggests that when the degree exponent t e (3,4) then the distances 

in this object scale like same distance scaling shown in this paper for the 

maximal components in the critical regime (Theorem ] 1.2| . 

This is not a coincidence. As has been shown in a series of fundamental papers |3]-[^ for 
the complete graph and the supercritical Erdos-Renyi random graph, a major ingredient in 
the analysis of the MST problem is the scaling of maximal components in the critical regime 
which then provides crucial input for the scaling limit of the MST. Till date we have no rigor¬ 
ous results for the scaling of the MST on any “inhomogeneous” random graph model. This 
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Figure 3.1. On the left, an approximation of an ICRT (using p-trees on ap¬ 
proximately 20000 vertices) corresponding to di oc j - 3 01 . 

The reason behind this choice of 0, is explained in Section]^ On the right, an 
approximation of a Brownian CRT (using a uniform random tree on the same 
number of vertices). Vertex sizes are proportional to the degree of the vertex. 


paper provides the first step in answering this question in the heavy-tailed regime. Further 
this program should enable one to analyze the MST for random graph models other than the 
rank-one model which belong to the same “domain of attraction” in the critical regime. 

3.3. Inhomogeneous continuum random trees. As evident from Section [2l2| ICRTs play a 
major role in the description of our limiting objects. Despite a lot of work on these objects in 
the last decade fTTj[T3|[27) , a number of questions regarding these continuum objects are still 
open, ranging from sufficient conditions for compactness to the dependence of the fractal 
properties of this object on the driving parameter 0. Our proof shows that in some special 
cases, ICRTs are compact metric spaces when 0 is sampled according to an appropriate size- 
biased distribution. This can be seen as an annealed result on compactness of the ICRT. 
Whether compactness is true for non-random sequences 0 e 0 has been open problem for 
more than a decade |^. Similar questions hold for its fractal dimensions. See Sectionj^for 
a more detailed account of these problems. 


3.4. Overview of the proof. In Section|^ we study the random graph „ (x, t) as in Definition 


1.4 We start with the simple observation that conditional on the vertex set of components of 


(x, t ), a fixed component has the same distribution as '^n (x, t) conditional on being con¬ 
nected. This section studies asymptotics for such distributions assuming specific regularity 
properties of vertex weights in the component in the large network limit, showing Gromov- 
weak convergence of the associated graph under proper normalization of edge lengths and 
vertex weights. Sectionj^uses the size-biased exploration of the process t) to show 
that maximal connected components satisfy the hypothesis required in Section 4 Section 
[^studies the special entrance boundary in ( |1.19| l proving both compactness of the limiting 
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objects as well as strengthening the convergence in the Gromov-weak topology to conver¬ 
gence in doHP- In Section]^ we derive the box-counting or Minkowski dimension. In Section 
1^ we conclude by describing a number of open problems. 

4. Proofs: Asymptotics conditional on being connected 

The aim of this Section is to study large connected components of (x, t) assuming vertex 

weights satisfy a few regularity properties. 

4.1. Tilted p-trees and connected components of t). Recall the random graph ^^(x, t) 
from Definition 1 1.4[ Here for any f ^ 0, i 3^ 1) denotes the components in decreas¬ 

ing order of their mass sizes. In this section we will describe results from ||^ which gave a 
method of constructing connected components of ^^(x, t) conditional on the vertices of the 
components. This construction involved tilted versions of p-trees introduced in Section [Z^ 
Since these trees are parametrized via a driving probability mass function (pmf) p, it will 
be easy to parametrize various random graph constructions in terms of pmfs as opposed to 
vertex weights x. Proposition |4.1| will relate vertex weights to pmfs. 

Fix n ^ 1 and T [n] and write for the space of all simple connected graphs with 
vertex set 7. Forfixed a > 0, and probability mass function p = [p^: veT), define probability 
distributions IPcon(':P> on G^"^ as follows: Define for i, j e T, 

qij :^l-exp[-apipj). (4.1) 

Then 

PconiG-,p,a,y):^ / Y[ qij fl (1 -for Gg G“", (4.2) 

(LPeRIG) iiJHEiG) 

where Z(p, a) is the normalizing constant 

Z(p,a):= Y. n ^ij n II 

GeG“" ii,j)eE[G] [iJUEiG] 

Nowlet7‘‘’ := I/('^^;(f)) be the vertex set of for i ^ 1 and note that {Z'’’: i ^ ifdenotes 

a random finite partition of the full vertex set [n]. The following result is obvious from the 
construction of ^^(x, t) : 

Proposition 4.1 (Proposition 6.1]). Conditional on the partition : i^\] define 

For each fixed i^ I, letGi e G“jj be a connected simple graph with vertex setT^^. Then 

p[^i{t) = Gi, 1 I i ^ 1}) = n Pcon(G/;p‘^\a® 7®). 

;>1 

Thus the random graph ^^(x, t) can be generated in two stages: 

(i) Stage I: Generate the partition of the vertices into different components, i.e., generate 

{y®: 1}. 

(ii) Stage II: Conditional on the partition, generate the internal structure of each compo¬ 
nent following the law of IPcon(-:p‘‘\<3!'‘’,Z‘'’), independently across different compo¬ 
nents. 
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Let us now describe an algorithm to generate such connected components using distribu¬ 
tion I 14.21 I. To ease notation, let Y -[m] for some m^\ and fix a probability mass function p 
on [m] and a constant a > 0 and write PconiO := IPcon(-:p> o., [ wD on := G™*!. We will first 
need to set up some notation before describing this result. 

Depth-first exploration of ordered trees. Recall that we used for the space of ordered 
(or planar) trees with vertex set [ m]. Given a tree t g one can use the associated order to 

explore the tree in a depth-first manner. More precisely we start with i'(l) being the root of t. 
At each stage 1 ^ i ^ m, we will keep track of three types of vertices: the set of active vertices- 
sd{i), the set of explored vertices-^(z), and the set of unexplored vertices-'%'(z). The set of 
active vertices will in fact be viewed as a vertical stack (not just a set) with si (z) representing 
the state of this stack at the end of step si (z). Initialize the process with si (1) = {zz(l)} (the 
root oft), ^(1) = 0 and^(l) = [m] \ {zz(l)}. At step z ^ 1, we let 

(i) v{i) denote the vertex at the top of the stack si (z) and let @(z) c'%'(/) denote the set of 
children of zz(z). Delete zz(z) horn si{i) and arrange the vertices of @(z) from oldest to 
youngest at the top of the stack to form sd (z + 1); 

(ii) ^(z + l)=(9(z)u{z;(z)}: 

(iii) ^(z + 1) ='^(z)\@(z). 

Write ^(t) for set of pairs of vertices {u, v} such that u,vesi (z) for some 1 ^ z ^ zn; namely 
both vertices are active but have not yet been explored. Using terminology from 0 , call this 
collection the set of permitted edges. Thus, 

^(t) := {{vii), u) \ 2 ^ i ^ m, uesi{i-\)\ {zz(z)}}. (4.3) 

Write flit) for the edge set of t. Now define the function L : ^ K+ by 


L(t) = L„„,(t):= n 

[k,e]EE(t) 


explapkpe) -1 
apkPe 


ord 


(4.4) 


Recall the (ordered) p-tree distribution from 
in the distribution 

] (t) : = Pord (t) ' 


exp Y. (^PkpA, teT 

\[k,e)eggi) ) 

. Using L(-) to tilt this distribution results 


Lit) 


tGT 


ord 


ord-- “oro.^. Eord[f^(=^^)]’ 

For future reference we fix notation for the various objects required in the proof below. 


(4.5) 


Definition 4.2. Fix m ^ I, a > 0, and a probability mass function p on [m]. We will write 
‘Srnip,d) to denote a random graph with distribution Pcon(->P><2> [m]). will denote a 

random planar tree with the tiltedp-tree distribution | |4-5| l, andST^ will denote a random tree 
with the original p tree distribution | |2.5| l . 


Proposition 4.3 (||^ Proposition 7.4]). Fix m^\, a probability mass function p on [m], and 
a > 0. Consider a random connected graph on [m] constructed as follows: 

(a) First generate a rooted planar random tree with distribution IPord(-) as in l |4-51 l . 

(b) Let'^{3'^’^) denote the permitted edge set of this random tree. Add each such edge{u, v} e 

with probability quv as in ( |4.U , independent across permitted edges. 

Then, the resulting random graph has distribution Peon on i.e, has the same distribution 
as^mip,a). 
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4.2. Convergence of connected components under weight assumptions. The aim of this 
section is to prove Gromov-weak convergence for the connected graph < 2 ) under reg¬ 
ularity conditions on a and p as m ^ 00 . We will assume that we have ordered the index set 
[m] so that pi^ Pm>0. Let 

Assumption 4.4. As 00 , the following hold: 

(i) cr(p) ^ 0 and further for each fixed i ^ 1 , pi/aip) Oi whereO := { 01 , 62 ,...) is an ele¬ 
ment of 0 as in ( | 2 . 6 | l . 

(ii) There is a constant j > 0 such that acr(p) ^ 7 . 

The following theorem is the main result of this section. 

Theorem 4.5. Consider the connected random graph ‘Smip.d) viewed as a metric measure 
space via the graph distance where each vertex v is assigned measure pv Under Assumption 

cr(p)^l;„(p, a] -^^oo{0,r)’ 

where ‘^oo {6, 7 ) is the random metric space defined in Definition \2.2\ and convergence is in the 
Gromov-weak topology on metric spaces. 

The rest of this section proves this result. We will throughout assume that has 

been constructed using Proposition |4.3[ 

4.2.1. Two constructions of p-trees: Exploration process and the birthday construction. We 
start by describing an explicit construction of the (untilted) p-tree ST^ first developed in . 
At the end of this section we describe a second construction used later in the paper. 
Exploration process construction: The first construction is initiated by setting up a map 
y/p : [0, l]’^ ^ as follows. Let u := {u^ : v e [m]) be a collection of distinct points in (0,1). 
Define 

m 

F^{s):=-s-tY, pi,l{Ui,^s}, 5e[0,l]. 

Assume that there exists a unique point v* e [m] such that F^{Ui,*-) = min^e[o,i]FP( 5 ). Set 
V* to be the root of the tree xf/p{u). Define y, := Ui - u^* mod 1 for i e [m], and 

F“‘'’P(s) FP(Uy* + s mod 1 ) - FP(u^* -), 0 ^ s < 1 . 

Then F®^^'P (1 -) = 0 and F^’^^^’P ( 5 ) > 0 for s g [0,1). Extend the definition of F^^^^’P to 5 g [0,1] by 
define F^’^^^’Pil) = 0. We use f®^‘^’P to construct a depth-first-search of an ordered tree whose 
exploration in this depth-first manner is encoded by the function f®^^'P. This in turn defines 
the tree y/p{u). As before, in this construction we carry along a set of explored vertices 0{i), 
active vertices .£ 2 /(i) and unexplored vertices'%'(/) = [m]\(.B/(i)u@’(i)),for0 ^i ^m. We view 
si{i) as the state of a vertical stack ^ after the ith step in the depth-first-search. Initialize 
with 0’(O) = 0, siiO) = {t'*}, '%'(0) = [m] \ (i'll)}, and define y*(0) = 0. At step i g [m], let 
v{i) be the value that is on the top of the stack sd{i - 1 ) and define y* {i) := y* {i - 1 ) + PvU)- 
Define S>(i) := {j g [m]: y* {i - 1) < yj < y* (i)}. Suppose @(i) = {u{j) j ^k} where we 
have ordered these vertices in the sequence that they are found in this interval, i.e., 

y* {i - 1) < y„(i) < ... < yuik) < y* Ul 
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Update the stack si as follows: 

(i) Delete v{i) from^. 

(ii) Push uij), I ^ j ^ k, to the top of sd sequentially (so that uik) will he on the top of the 
stack at the end). 

Let si ii) he the state of the stack after the above operations. Update 0ii)©{i - 1) u {vii)} 
and (i - 1) \ ® (i). See Figure [4U]for a pictorial description of this construction. 



Uq Uq U3 U5 Ui U2 U4 U-j 

\-Z-^ - O - C — 3 - C 3 - 

->- > -> — > — > - > - > - > 

P4 Pi P3 P5 P8 Pi Pg Pi 



Figure 4.1. The function FP and the corresponding tree i/Ap. 


The tree y/piu] e is constructed hy putting the edges {(i'(z), v):i e [m], v e Q>ii)} and 
using the order prescribed in the above exploration to make the tree an ordered tree. The 
fact that this procedure actually produces a tree is proved in . 


Lemma4.6 Section 3.2]). Consider the map if/p. LetX:- {Xp : v e [m]) be Ltd. random 
variables distributed uniformly on (0,1). Then the random tree if/pOQ has distribution | |2.5| l, 

i.e., i/Ap(X) = 5-P. 
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For future reference, coupled with the above construction, define 5^ii)sd{i - 1) \ {vii)} 
for i e[m]. Define the function Ami-) on [0,1] via 

Amiu):^ ^ Pv, for ueiy*ii-l),y*ii)],ie[m]. (4.6) 


Further let Amiu) := aAmiu), u e [0,1], where a is the scaling constant in | |4-l| l- 
Birthday construction: We now describe a second construction of p-trees, first formulated 
in . We urge the reader to skim this portion and return to it once she has reached Section 

Let Y := (Yo> Yi,...) be an infinite sequence of Ltd. random variables with distribution p. 


4.5 


Let Rq-0 and for Z ^ 1, let i?/ denote the Z-th repeat time, i.e.. 


=min|A:> i?/_i : g {Yo,-.-, Yfc_i}|. 


Now consider the directed graph formed via the edges 

5-(Y) := Yj) :Y,t{Yo . Yj-t}.js\}. 

It is easy to check that this gives a tree which we view as rooted at Yq. Intuitively the process 
of constructing a tree is as follows: the tree “grows” via the addition of new vertices sampled 
using p till it stumbles across a “repeat” (a vertex already found) when it goes back to the 
first occurrence of this “repeat” and starts growing from that position. The following striking 
result was shown in . 


Theorem 4.7 ((^ Lemma 1 and Theorem 2]). The random tree ^(Y) viewed as an object in 
Jm is distributed as a p-tree with distribution \2A) independently o/Yrj-i, which 

are i.i.d. with distribution p. 


Remark 5. The independence between the sequence Yr^-i,Yr^-i,... and the constructed p 
tree cTCY) is truly remarkable. In particular, suppose ^ is a p-tree with distribution as in 
( |2.4| l and for fixed r ^ 1, let Yi,Y 2 ,... Yy be i.i.d. with distribution p. Write for the tree 

spanned by these vertices and the root. Let c ,^(Y) denote the subtree with vertex set 

{ Yo, Yi .namely the tree constructed in the first Ry steps. Here ^ is a mnemonic for 

“birthday tree” and also to distinguish this construction from a generic random tree model 
with r vertices. Then the above result (formalized as Corollary 3]) implies that these can 
be jointly constructed as 

(Yi, Y2, ..., Yy-,^y) = (Yfl,_i, Yr,.,, ... YR^.p,3rf). (4.7) 

We use this fact often in Section lT^ 


4.3. Uniform integrability of the tilt. The first use of the above construction of the p-tree is 
to prove the following: 

Proposition 4.8. Fix s^ \ and consider the tilt Lb) as in ^AA) . Under Assumptions \AA\ there 
is a constant K := Kis) <oo such that 

sup Ey,y4[Liir^)Y]^K. 

m>l 

In particular, the collection of random variables {LitTm ): m ^ 1} is uniformly integrable. 
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Proof: Writing out the tilt L(-) explicitly, we have 

exTpiapkPe) -1 


n 

ik,e)eE{t) 


apkPe 


expf Y. apkpA^I(t)Ut), 


say, where. 


_ exp(ap;P/)-l I _ -V 

I(t):= [[ -^expla 2^ pipjl ^ expiapi). 

[i,j)eE(t] <^PiPi '■ [i,j)eE{t] 


m) 


(4.9) 


Here we have used (e-^ - l)/x ^ e^ for x > 0 for the first inequality and the second inequality 
follows using the fact that t is a tree, so that for each {i, j) e Eft) such that i is the parent of 
j, we have pipj ^ PiPj- By Assumption [O} we have api jOi. In particular, there is a 
constant C > 0 such that for all m ^ 1, and t g 

I(t) ^ C and L(t) ^ Cexpf Y ‘^PkpA- (4.10) 

Now recall the functions Am and Am a Am from | |4.6| |. Using the equivalent characteriza¬ 
tion of the permitted edge set from | |4.31 and comparing this with | |4.61 l, it is easy to check 
that 

Y apiPj^aY L PiPj^f ^mis)ds. 


Now hy the definition of 


By M, 


Thus 


T'''xC’P(y*(j)) = Y Pv, for ie[m]. 

vesi[i) 

Am{t)= Y Pv^ Y Pv-Pv{i), forte(y*(f-l),y*(f)]. 

veS^ii) vesi^(i-l) 


(4.11) 




(4.12) 


By Assumption [4i4) (ii) and H.IOL for any 5^0, there exists K = Kis) <oo such that 

_||^exc,p||^ 


[KJ-Pij'^C^exp 


K- 

V o-(p) 


Now the following lemma completes the proof of Proposition |4.8[ 

Lemma 4.9. There exists a positive constant c> 0 such that for every m^\ and x ^ e. 


IF-^^’PlIoo^ 


xcr(p)) ^ exp (- cxlog(logx)). 


Proof: Write ^(m) := ||F®xC’P||^/o-(p) and as before, letX= : v e [m]) he the collection of 
uniform random variables used to construct FP. Write Q[0,1] for the set of rationals in [0,1]. 
Then note that 

FP(fl) FP(fl) 

S^{m)= sup-inf -:= ^i(m)-t ^ 2 (w). 

^?eQ[0,l] (T(p) qmo,l] (7(p) 


(4.13) 
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We start by analyzing (m). For fixed q g Q[0, 1], define the collection of m functions 

Pi 




J . 


Note that for all j g [m], : [0,1] ^ [-1,1], with Eis^iXj)) - 0 and further 

^i(m)= sup [5^(Xi) + --- + s^(X^)]. 

Also note that 

sup Var|s^(Ai) + --- + s^(A^)j = sup 


qe(Q[0,l] 

If we can show that 


qeQ[0,l] 


K supE(^i(m)) < cxa, 

m>l 


(4.14) 


then standard concentration inequalities for the maxima in empirical processes Theo¬ 
rem 1.1(b)] will imply the existence of a constant Ci > 0 such that for all m 3^ 1 and x > 0, 


P(^i(m) ^ E(^i(m)) -t x) ^ exp --log 


l + 2log 1 + 




(4.15) 


Let us now prove l |4.14h In fact we will show the stronger result: 


supE 

m>l 


sup 

^^ye(Q)[0,l] 


Z 


7 = 1 


< oo. 


Let X(i) < A( 2 ) < • • • < denote the order statistics of X and let n denote the corresponding 
permutation of [m], namely X,,, = X^^). Note that 


sup 

^/eQIO.l] 


m 

Z 

7 = 1 


:= max \di\, where di\- 


X{i] + = l P^ij) 

crip) 


Hence 


maxjdjl ^ max [cr(p)] 

ie[m] i£[m] 


-1 


i 

-Xq, + - 

+ max[cr(p)] ^ 

tpnip-^ 

m 

ie[m] 

7 = 1 ^ 


:= ^ii(m) -f .^i2(m). 

We first analyze (m). By the DKW inequality 

i 


max 

ie[m] 


-Xq,+ 


m 


aipix ^ 2exp|-2m- (cr(p)x)^j 


By Cauchy-Schwartz, mcr^(p) ^ (£/ pi)^ - 1. Thus sup^^^ E(^ii(m)) < oo. We now analyze 
^12 (m). Since 

, 1 . , Zk^ee[m]PkPe l-cr2(p) .,.^r , 

Eip„(i)) = —, and Eip„^i)Pn{j)) ^^--— = —-- for i^]e[m\, 

■' m m(m-l) m(m-l) 

for any ie[m\ we have 


E 


E Pnii) 
VV7=1 ^ 


ia^ ip) iii-l) 


+ 


m 


mim - 1) 


(l-a^(p))- —^-C7"(p) 


m 


(4.16) 
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by simply expanding the square. Now note that since tt is a uniform random permutation of 
the vertex set [m], for any fixed i ^ 1 we also have 


y=i 


i d 

- Z- Pn(m-i) 
^ j=0 


i 

m 




m-i 

m 


m-i 

'y, Pnii) 
i=i 


Thus 


E(^i 2 (m))^ 2 E max [cr(p)] ^ 

1 ie[ml2] 


7 = 1 



(4.17) 


Now assuming that we construct n by sequentially sampling without replacement from [m], 
let ^jc denote the a-field generated by {Ti{\),n{2),nik)) for 0 ^ A: ^ m - 1. Let Mq = 0 and 
consider the sequence 


Mic'.- -, 

m-k 


It is easy to check that {M^ :0^k^m-l} is a martingale with respect to the filtration 
: 0 ^ A: ^ m - 1}. Then I l4.17| l and Doob’s L^-maximal inequality yield 


E(^i2(m))^ 


2m 

(Tip) 




Using ( |4.16t with i - m/2 then gives E(^i 2 (m)) ^ 16 for all m ^ 1. Thus we have shown that 
sup^^jmax(E(.^ii(m)),E(^i 2 (m))) < oo. This proves l |4.14| and thus l |4.15| l. 

To complete the proof of the lemma, we need to get a tail bound on 5?2(w) appearing in 
( 14.131 1. As before, using 1^, it is enough to show sup^^^ < oo. However, note that 


/^z^rn) - max 

iE[m] 




^ ^i(m) + 


Pi 


cr(p) cr(p) 

We now use | |4.14t together with Assumption|4.4|to complete the proof. 


4.4. Another construction of (p. (k) and a modification. In this section, we start by giving 
a more explicit description of the algorithm described in Proposition |4^ via adding permit¬ 
ted edges to a tilted p-tree. We first set up some notation. As a matter of convention, we 
will view ordered rooted trees via their planar embedding, using the associated ordering to 
determine the relative locations of siblings of an individual. We think of the left most sibling 
as the “oldest”. Further, in a depth-first exploration, we explore the tree from left to right. 
Now given a planar rooted tree t e ¥„, let p denote the root and for every vertex g [m], let 
[p, v] denote the path connecting p to i' in the tree. Given this path and a vertex i e [p, v], 
write [p, v]) for the set of all children of i which fall to the right of [p, v]. Thus in the 

depth-first exploration of the tree, when we get to u, 

^( 17 , 1 ) := [P, V]) 


denotes the set of endpoints of all permitted edges emanating from v. Define 

em^HAp, V])}. 

i£[p,v]je[m] 


(4.18) 
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The function Am{-) defined in | |4.6| l is intimately connected to More precisely, let 

(i'(l), 1^(2),..., v{m)) denote the order in the depth-first exploration of the tree. Let y* (0) = 0 
and y* {i) = y* {i - 1) + PvU)- Define 

for ue{y*{i-l),y*{i)], and A(„,(0 := (4.19) 


Then the function associated with an ordered p-tree has the same distribution as the 
function Ami-) associated with the tree y/pOQ, where X = (Xy : u e [m]) are i.i.d. random 
variables uniformly distributed on (0,1). 

Finally, define the function 

A.[m) (t):=a Y. Pu^im] iv). (4.20) 

ve[m] 


While all of these o bject s depend on the tree t, we suppress this dependence to ease nota¬ 


tion. Now Proposition 4.3 implies we can construct ^mip, (A via the following five steps: 


(i) Tilted p-tree: Generate a tilted ordered p-tree S'm* with distribution l |4-5h Now con¬ 
sider the (random) objects %iiv,^m*) for v e [m] and the corresponding (random) 
functions 0(„)(•) on [m] and A(„)(-) on [0,1]. 

(ii) Poisson number of possible surplus edges: Let denote a rate one Poisson process 
on and define 


(s)}. 


(4.21) 


Write n := {isj, tj) :l^j^ iV*,} where iV*, = | n &>\. 

We will now use the set {isj, tj) : 1 ^ j ^ ^*m)} to generate pairs of points 
{i^j,Sij ): 1 ^ 7 ^ ^(m)} ill tho ti®® wiH b® joined to form the surplus edges. 

(iii) “First” endpoints: Fix j and suppose sj e (y* ii - 1), y* (i)] for some i ^ 1, where y* ii) is 
as given right above 1 14.191 1. Then the first endpoint of the surplus edge corresponding 
to iSj, tj) is S^j uii). 

(iv) “Second” endpoints: Note that in the interval (y* (z - l),y* (z)], the function is of 
constant height a&i^fuii)). We will view this height as being partitioned into sub- 
intervals of length apu for each u e ^ivii),tTm*), the collection of endpoints of per¬ 
mitted edges emanating from (Assume that this partitioning is done according to 
some preassigned rule, e.g., using the order of the vertices in ^( zz (z), tTm*) ■) Suppose tj 
belongs to the interval corresponding to u. Then the second endpoint is S^j - u. Form 
an edge between iSffj, t%j). 

(v) In this construction, it is possible that one created more than one surplus edge between 
two vertices. Remove any multiple surplus edges. 

Lemma 4. 10. The above construction gives a random graph with distribution ^mip, d) as in 
Deflnition \A.2\ Further, conditional on STm* ■ 

(a) , has Poisson distribution with mean A(„,) iSTm*) where is as in ( |4.20t . 

(b) Conditional on 3'm* and Ai^*, = k, the first endpoints (ifj-: 1 ^ j ^ k) can be generated in 
an i.i.d. fashion by sampling from the vertex set [m] with probability distribution 


^""’(zz) oc Py0(„)(zz), zzejznj. 

(c) Conditional on STm*, Ai^*, = k and the first endpoints {5£j : 1 ^ j ^ k), the second end¬ 
points can be generated in an i.i.d. fashion where the probability thatS^j - u is propor¬ 
tional to pu ifu is a right child of some individual y e [p, S£j]. 
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Proof: The assertions follow from Proposition |4.3| and standard properties of Poisson pro¬ 
cesses. ■ 


The modified space a): We construct a modified graph a) as follows: 

(i') Generate a tilted ordered p-tree with distribution ( |4.5t . 

(ii') Conditional on generate iV^*, ~ Poi(A(„)(.^^’*)). 

(iii') Conditional on and iV^*, = k, generate the first endpoints : 1 ^ 7 ^ A:) in an 
i.i.d. fashion hy sampling from the vertex set [m] with prohahility distribution 


pv<&(,n)iv), ve[m\. 

(iv') Conditional on 3'^^, iV(*, = k and the first endpoints {5£j : 1 ^ j ^ k), generate the 
second endpoints in an i.i.d. fashion where conditional on - v, the probability dis¬ 
tribution of is given by 


Q'rw 


l.uPu'^{ue3t^{y,[p,v])] l(&^rn){v) if ye [p,u], 
0 otherwise. 


(4.22) 


Identify and Mj for 1 ^ j ^ k. 

Thus, instead of adding an edge between and one of the right children on the path 
[p,S^j] as in Lemma |4.10rc), we identify it to the parent of this vertex which is on [p,S^j]. 
Also, we do not remove any multiple surplus edges. This construction turns out to be easier 
to work with. a) will be viewed as a metric measure space via the graph distance 

where vertex v has mass X Pu where the sum is taken over all u e [m] which have been iden¬ 
tified with V. Intuitively it is clear that cr(p)#m(p> ci) and a) are “close”. This is 

formalized in Lemma l4.12[ 


Remark 6 . At this point we urge the reader to go back to Section [2.3.1 [ and remind them¬ 
selves of the four steps in the construction of the limit metric space ^ooiO,Y), and note the 
similarities to the construction above. In particular, we make note of the following: 


(a) For finite m, we essentially tilt the p-tree distribution via the functional - 

exp(aE[0(„) (Vi) | 3'^]) (the term 1{3'^) as in ( |4.8t can be ignored as we will see in Lemma 

I, and the number of shortcut points selected, namely AI(*,, has a Poisson distribu¬ 
tion with mean aE(0(„)(Vi) | ST^*). Here Vi has distribution p. 

(b) For the limit object, we tilt the measure using the functional - 

exp(7E[0(oo)(Fi) I U]), and the number of shortcuts, namely follows a Poisson 
distribution with mean 7 E( 0 („„)(Vi) | U*). Here Vi is distributed according to the 

mass measure p* on 3[^*. 



As a brief warm-up to the kind of calculations in the next section, we now prove a simple 
lemma on tightness of the number of surplus edges. We will prove distributional conver¬ 
gence of this object in the next section. 


Lemma4.11. Under Assumption\'^A\ the sequence 
below | |4-211 l . 


: m ^ 1 } is tight, where is as given 


Proof: Fix r > 1. First note that conditional on 3',^’* = t, AI(*, has a Poisson distribution with 
mean A(„,(t). Thus, there exists a constant C-C{r) such that 


E([iV*,]''|5-P'* = t)^C[A,„,(t)]''. 
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Further, note that the tilt L(t) in jO) satisfies 


L(t) := I(t)exp 


^ apkPe = I(t)exp(A(„)(t)), 
V(fc,f)e*P(t) j 


where 1 ^ I(t) ^ C' for a fixed constant C' independent of m by ( |4.91 l. Thus, Proposition |4.8| 
shows that 


sup E(exp(7A(„,(5'P))) < oo 

m>l 


for any 7 > 0. In particular. 


sup EiliV*/) ^ sup CEflA^^d-^m *)]'■) = Csup 

m&l m^l ^ ' m^l 


E(I(5-P)) 


< 00, 


which proves tightness of {Ai,*, ■ 

We conclude this section by proving a lemma which essentially says that it is enough to 
work with the modified space a). 

Lemma 4.12. Recall the five-step construction of^miV’ Construct a) on the same 

space by coupling it with o) in the obvious way. Then, under Assumptior ^^ 

<^GHP a), a) j 0. 


Proof: Define the event 


F {Ai(*, equals the number of surplus edges in ‘Smip, d)}. 


In other words, F describes the event in which does not have multiple surplus 

edges. It is easy to check that 


<^GHP [^m (P> a) , (p, a) j 


^ Ai(*, on the set F 


Thus, Lemma 4.11 combined with the assumption cr(p) ^ 0 yields the result provided we 
show that P(F^) ^ 0. To this end, note that 


P ^3 multiple surplus edges between u and - tj 

= P {Po\iapuPv) ^ 2) ^ ciapuPvf 


for every ue[m], ve^iu,t), and some universal positive constant c. Hence 
p(F^|5'P’* = t)^caV(p)2 Y. pI Y. {Pv/crip)Y 

uelm] veSpiUyt] 

^ ciaa{p]f Y ciaa{p)fa{.pf. 

ue[m] 

Since (7(p) ^ 0 and acr(p) ^ 7, P(F^) ^ 0 as desired. ■ 
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4.5. Completing the proof ofTheorem |4.5t At this point we urge the reader to remind them¬ 
selves of (a) the four steps in the construction of the limit object in Section [Z3| (b) the birth¬ 
day construction of p-trees at the end of Section [4.2.1| and (c) the definition of Gromov-weak 
topology in Section [2.1.2| of complete separable measured metric spaces . Fix ( and a 
bounded continuous function 0: IRf ^ K. Let O be as in | |2.31 l. To simplify notation, we will 
write <1>(A) instead of <1>(A, d, p). To prove Theorem |4.5[ we need to show that for every fixed 
\ and functions <p and O as above, 

E[<l>(cr(p) •#TO(p,a))] ^ E[O(‘J^oo(0,r))] as m^oo, 

where we sample £ points according to p in d) while we sample £ points according to 
the measure on inherited from the mass measure. Now recall the explicit five step 

construction of d) in Section [Zi] starting from the tilted p-tree 3'^’* and the Poisson 
number of surplus edges N*. Fix I and note that 


(m)' 

K 


E[0(o-(p)#„(p,a))] - ^ E[<l)((7(p)#^(p,a))]l{Ai*, = A:}] 

k=Q 


\moonN*^K+i). 


Using Lemma 4.11 we can choose K large (inde pend ent of m) to make the bound on the 

we can work with a) instead 


4.12 


right arbitrarily small. Further, in view of Lemma ■ 
of (p> d ). Hence it suffices to prove the following convergence for every fixed A: ^ 0: 


E 


O ((7(p)^r‘^(P-«)) = A:}] - E [O (^oo(0,r)) = k}] as m - oo. (4.23) 

To analyze this term, we first need to setup some notation. 

Note that both the finite m and the limit object are obtained by starting with a discrete 
tree for finite m and a real tree in the limit, and sampling a random number of pairs to create 


“shortcuts”. Recall the space T*^ in Section 


2.1.3 


Fix A: ^ 0 and let t be an element in T* 


for some / ^ 0. will not play a role in the definition below. Write p for the root and denote 
the leaves by 

^k,k+( ■— (-^li ^2y • • • I ^k’ ^k+ly •■•y ^k+t)- 


Also recall that for each i, there is a probability measure Vtj (•) on the path [p, x;] for 1 ^ i ^ 
k + £. For i^k, sample y/ according to the distribution Vtj (•) independently for different 
i and connect Xi and y/. Let t' denote the (random) tree thus obtained and let de denote the 


graph distance on t'. Define the function gV : T 


(p ■ i,ik+e) 


:by 


-i ^[<p[dt’iXi,Xj):k+l^i^k + £)],iit9^d, 

1 0, ift = d. 


(4.24) 


In words, we look at the expectation of (p applied to the pairwise distances between the last 
£ leaves after sampling y, on the path [p, xd for 1 ^ i ^ A: and connecting x/ and y,. Note that 
here the expectation is only taken over the choices of y, . 

Next, given t e and v {ui, ...,Vr) with Vi e [m], set t(i;) to be the subtree of t span¬ 
ning the vertices v and the root provided ui,...,Ur are all distinct and none of them is an 
ancestor of another vertex in v. When this condition fails, set 1 ( 1 /) = d. 

Now, conditional on construct a tree 3'^* where 


(il y'™’ 

^k,k+e 


•“'■'^1 . '^k ’'^k+V-'^k+t^’ 


(ii) W'" , 1 ^ i ^ A: are i.i.d. with the distribution (•) as in Lemma 4.10 fi); and 
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(iii) Vll\,... are i.i.d. with distribution p. Further, .are jointly 

independent. 

We will drop the superscript and simply write Vj, F/ etc. when there is no scope of confu¬ 
sion. Note that (V k.k+f) - d whenever Fi,..., F^. 14+i> • • • 14+/ are not all distinct or one 
of them is an ancestor of another vertex in Vyt,fc+/. In either of these two case, the subtree 
spanned by the root and Vk.k+e will have less than k + £ leaves. We made the convention of 
setting = d to make sure that we are always working with a bona fide element 

in T* However, this makes no difference at all since by Corollary 15], 

limP(5-P(Fi. Ffc+^)=d)=0 

m ^ ^ 


where Fi,..., Vk+e are i.i.d. p random variables. Now 3'^* is obtai ned by tilting the distribu¬ 
tion of 3'^, where the tilt L(-) is uniformly integrable (Proposition |4.8| . Further, Vj, I ^ i ^ k 
are i.i.d. with the distribution oc ^^0,„,(/') where maxy0(„)(r') is stochastically dom¬ 

inated by IIF^^^^’Pjjoo (see l |4.12t and the discussion below | |4.19] l). It thus follows that 

lim P fj-P’* ^k.k+e) = d] = 0. (4.25) 

mV J 

Using ( |4.25] l, we see that 


<l)((T(p)^^““‘^(p,a)) ]l{iV*, = k} 


(4.26) 


e{Ep.+ [g;"’ ((T(p) 5 -P'Vfc,fc+/))] ll{iv*, = k}} + 0(1), 


where Ep,*(-) := E(-|.^^’*). At this point, we also define Ep(-) := Ej-j^j^) where 3'^ has the 
original ordered p-tree distribution I l2.5t . 

Now since ^‘'"’(o) oc we see that the inner expectation in | |4.26i l can be simpli¬ 

fied as 


E 


p,+ 


gJ^(o-(p)5-P’Vfc,fc+/)) 


E. 


P,* 




(4.27) 


[Ep,*(0„„,(^i)]^ 

where Vk,k+f - (^n -14+/)> and F/ are i.i.d. with distribution p. Since 3'^'* is sampled 

according to a tilted p-tree distribution, combining | |4.26] l, and l |4.27| l, we get the following 
result: 


Lemma 4.13. Fix k^O. Then 


E 


o(cr(p)<:^r^P-«))fl{<, = A:} 


(4.28) 


= CmE 


^p [{nU ^UVi)) 4' (o-(p)^^(Vfc,A:+/)) 


[Ep(0(„,(Fi)]fc 


L(5-P)]l{iV,„, = k} 


+ 0 ( 1 ), 


whereCm-[^{£{3'^))] \ and L is the tilt as in i4.4| . Further, conditional on 3^^, hasa 
Poisson distribution with mean A(„)(.!JjP) = aEp(0(„)(F)) as in | |4.20h where V has distribu¬ 
tion p independent of 3'^. 


This formula will be the starting point to prove l |4-23t . Recall from | |4.8| l that the tilt 
L(-) = I(-)F(-), where !(•) has a messy form given by l |4.91 l. We have already seen in | |4.10| | 
that under Assumption|4.4[ !(•) ^ C for a constant C all m 3^ 1. The following lemma coupled 
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with dominated convergence theorem will now imply that we can replace L with L in Lemma 
|4.13|and in all the subsequent analysis helow: 


Lemma 4.14. Under Assumption 


4.4 


l{Sr^) ^ 1 a5 m ^ oo. 


Proof: By | |4.9| | we have 1 ^ ^ expiaZ^^ i)eE[ 3 ''^) PkPi)- Thus it is enough to show that 

DeEiS'^) PkPi) 0- Now for k^le [m], write {A:'^ /} for the event in which I is a child 
of k in 5'^. Then standard properties of p-trees Section 6.2] implies that for k^li^l 2 ^ 

[m\ 


V{k^li)-p]^, P(A:'^ Zi and A: k) = pI- 


(4.29) 


Thus 


aE 


E PkPl 

\ik,l]eE[3-^) 


m 


m 




aY^PkY. PiPk a[(T(p)]^ ^ 0, 

fc=l Uk fc=l 


as m ^ oo hy Assumption 4.4 


Write Eg for expectation conditional on and the random variables 17“ that encode the 

order on 3'^y i.e., 


Ee(0:=E(-|.7;£,, u], 

and note that E[O(‘;^oo( 0>T)) 11{ZV(*, = A:}] has an expression similar to l |4.28t . Indeed, from 


the construction of ^ooi0,T"> given in Section |2^.3.1[ it follows that 

A:}] 


E[cD(<^oo(0,r))il{iV, 


(oo) 


(4.30) 


= CnnE 


Ee 




[E0(0,..,(i^r’)] 




0(oo)(-) is as defined in (b) 7(^,('^£)>is as in (c) Coo = 

1 rA\ random variables sampled from using the mass 

= (f) is the tree spanned by the root of 


where (a) 

[EL,^,(5;»,f/)]-\ (d) 
measure p, (e) 

,7^®, and viewed as an element of T, 


o,k+e 


and the root-to-leaf measures to be QlT’ (•) as in 

''i 

has a Poisson distribution with mean 


by declaring the leaf values to be 0(oo)(T3°°) 
, and (g) conditional on (^£,, U), 


''(oo) 


-^(oo) • 


:= r [ , euy)pm = e^ [0,^,(v'“>)] . 


Finally, observe that 7(„)(-) = I(,„,(-)7(,„,(-) where 7(,„,(t) = exp(aEp[0(„j(lA™O]), and recall 
that acr(p) ^ y (Assumption 4^ and is uniformly integrable (Proposition 4.81. 


Therefore, combining Lemma 
yields 1 14.231 1 and thus completes the proof of Theorem|4^ 


4.141 Lemma [4. 131 and i4.301 l with Theorem 4.15 stated below 
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Theorem 4.15. For each k^O, 


crip) 


y Ep 


uU f^(p) , 


4‘’(a(p)5-P(v5^,)) 


(4.31) 




k 

E 

\i=l 




The proof of this theorem is accomplished via the following two theorems for which we 
need to set up some notation. Fix 1^0 and / ^ 1. We will assume that 3'^ has been 


constructed via the birthday construction (see Section [4.2.If . This construction gives rise 
to an unordered p-tree. To obtain an ordered p-tree from this, let Q>^m)ii) denote the set 
of children of i in the p-tree for every vertex i. Generate i.i.d. uniform random variables 
U^m)ii) ■- {U^m).ii^) ■ ^ ^ ®(m)(0}, Independent across v e 3'^. Think of these as “ages” of the 
children and arrange the children from left to right in decreasing order of their ages. We can 
construct the function 0(m)(-) as in ( |4.181 l once this ordering has been defined. 

Now recall that the right hand side of l |4.7| l tells us how to sample / i.i.d. points 
(Vj''"’,..., Fj™’) from distribution p and the corresponding spanning subtree 3'j^ from the 
tree using the repeat time sequence : A: ^ l}. Thus, by the /th repeat time Rj, we would 
have sampled all / vertices F('”’ = Yr.-i. View 3'j^ as a tree with edge lengths and marked 
vertices as follows: (a) rescale every edge to have length cr(p); (b) relabel Vj as j+ and the 
root as 0-t; (c) mark only those vertices i ^ I which occur in 3'f^-, (d) for all 1 ^ j ^ J, set the 


leaf values to be 0(„)(Vy)/(7(p), and assign the measure := as defined in l |4.221 i to the 


path connecting the root to Vj, i.e, to the path [0-t-, j-t] . 


Definition 4.16. Fix / ^ 0, / ^ 1 and consider the tree constructed as above. Set = d 

if some j+ is not a leaf or if some leaf has been multiply labeled. Otherwise, write rj^ e T///or 
the tree with edge lengths and at most I labelled hubs, namely where we retain information in 
(a) and (b) above. Write g T^^ for the tree where we retain all information (a)-(d) above, 
namely the leaf values 0(m)(Vj) and the root-to-leaf probability measures Q^’(-) in addition 
to (a) and (b). 


Now recall the tree defined in Section 
dients in the proof of Theorem|4.15|are the fo 


2.2 


using the limit ICRT 3',^y The main ingre- 


Theorem4.17. Under Assumption 


4.4 


lowing two theorems: 

(oo) 


l^fj asm^ oofor every fixed 7^0 and J^l. 

This convergence is with respect to the topology defined on T*jj in Section\. 


2.1.3 


The second result we will need is as follows. Recall the function on T* as in 04.24 1. 


Theorem 4.18. Fix I ^0, k^O, £ ^2 and a bounded continuous function <p on Then the 
function g® is continuous on T* . 


Proof of Theorem |4. 1 5| Assuming Theorems 4.17 and 4.18 let us now show how this com¬ 
pletes th e pro of. Getting a handle directly on the conditional expectations as required in 
Theorem 4.15 is a little tricky. Naturally, conditional on 3'^, repeated sampling of vertices 


and calculating sample averages should give a good idea of the conditional expectations (and 
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the same for the limit object ^f,,) • This is made precise in the following simple lemma whose 
proof we leave to the reader. 

Lemma 4.19. Suppose with m e {1,2,...,} u {oo} is a sequence of Un¬ 

valued random variables such that for each fixed r ^ 1, there exist random variables Xf^ 
such that the following hold: 

(i) There exists a constant C <oo such that for any me {1,2 .} u {oo}, r ^ 1 and e>0, 

max >£) ^ 

5 = 1,2 '■ ^ e^r 

(ii) For each fixed r ^ 1, X'/"’ 

ThenX^"'^ ^X'“>. 


We will apply this lemma with the random variables that arise in Theorem 4.15 That is, 
we set 


X 


(m),l ._ 


E. 


crip) 




and similarly define and to be the second coordinates in the display ( |4.311 l. To 
define Xf\ we proceed as follows. For each fixed r ^ 1, sample a collection of Jr [r + (A: + 

£) r] points all i.i.d. p from 3'^ and think of them as r individuals points-(Vj*'"’, .K'™’)- 

. for l^i^r. Define 


and r (A: + f) dimensional vectors-V*'"’’' ■-jrm 


»r ('■):= n 


jl e. (v!f) 


i] 


t = l 


o-(p) 


for 


1 ^ A ^ r. 


For m = oo, sample as above Jr points using the mass measure p from and define 

»;”’(<) :=n®„(V,“)4‘'(^r*(V5L)). for isisr. 

7 = 1 


Now define 




rcr(p) 


for m e {1,2,...}, 




_ 




for m G {1,2,...} u {oo}. 


LetX'/"’ := for m e {1,2,...} u {oo}. To complete the proof of the theorem, we have 

to check the two conditions of Lemma |4.19l Let us check condition (i) of Lemma |4.19| for the 
first coordinate. The second coordinate can be handled in an identical fashion. 

Applying Chebyshev’s inequality conditional on and then taking expectations, we get 

p^lX^rnu _ xfrU] > ^ r) E(Varp (0 (^i)/(T (p))) =: say, 

where Varp defined analogously to Ep is the conditional variance operator. Obviously 


E 


Var. 


crip) 


^Var 


cr(p) 


^E 


o-(p) j 
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From the argument given below l |4.111 l, it follows that ||(S(m)||oo ^ 11I loo- Flence Lemma 
|4.9| implies that sup^ Q^) < oo. This verifies (i) of the lemma. 

Let us now verify condition (ii) of the lemma. Writing this out explicitly, we have to show 
for each fixed r ^ 1, 


(4.32) 



d 


[ raip) ’ r j 


{ r ■ r j 


To this end, for each m e {1,2,...} u {oo}, consider the subtree sp annin g the Jr points 

viewed as an element of T^^^ as in Definition 

we get 


4.17 


and continuity of the function from Theorem 


<P 


4.18 


4.16 


Using Theorem 


o-(p) 






with respect to weak convergence on IR^'", which in turn implies l |4.32t . This completes the 
verification of the conditions of Lemma [4.19| and thus the proof of Theorem |4.15[ ■ 

The rest of this section proves Theorems |4. 1 7| and 4.18[ 


Proof of Theorem 4.17| The proof will rely on a truncation argument that is qualitatively 
similar to Lemma [4. 19 Fix a truncation level i? ^ 1. Recall the definition of from 


( |4.18| l which kept track of the contribution of all right children of individuals i on the path 
[p, v]. We will look at a truncated version of this object where we keep track of the potential 
contributions of only the first R vertices. More precisely let 

E E P]Hj^S^^ii,lp,v])}. (4.33) 


0^ V 


i£[p,v]ie[m] 

i^R 


Let 0^)(-) be the analogous modification of 0(oo)(O defined in ( |2.7t , i.e. 


i^R 


Y, uf X 


Similarly modify the “second endpoint” measure in | |4.221 l to keep track of only ancestors 
with labels ^ R, namely 


Q‘r'^(y) := 


\ZuPu^{ue^^iy,[p,v])}/&f^^iv), 

10, 


if y G [p, u] and y ^ R, 
otherwise. 


Note that this does not make sense if - 0, i.e., when there is no vertex with label 

^ i? on the path from the root to v. In this case we follow the convention of defining the 
measure to be the uniform probability measure on the line [p, v]. Define Q'“’"”(0 on in 
an analogous fashion. 

and assign to leaf Vj the truncated measure 


Consider the tree as in Definition 


Q^’'^(-) and leaf value 0 


4.16 


(17;) (instead of Q^(-) and 0(,„)(U/)/cr(p)). We denote the resulting 


(m)' 


-v. 


object (which is an element of TC) by . Similarly construct ^ 


a{oo),R 


Proposition 4.20. The following hold: 
(a) For all R^l, 
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asR^oo. 

(c) For any bounded continuous function f: T 


(b) 


* 

11 


R^oo m^oo 


limsuplimsup 


'U- 


= 0 . 


Assuming this proposition, we now complete the proof of Theorem |4.17[ Note that for any 
fixed bounded continuous function / on and any truncation level i? ^ 1, we have 


+ |E(/(^'7''^))-E(/(^'7>))|. 

Now letting m^oo and then letting R^oo and using Proposition|4.20|completes the proof. 

■ 

We next prove Proposition|4.20[ 


4.6. Proof of Proposition 4.20[ We start with three preliminary lemmas. Recall that j} 


denotes the event that j is a child of i in 
Lemma 4.21. Under Assumption \AA\ for each fixed i ^ 1, 

Zje[m]PjHi-^i] Pi 


cr(p) 


o-(p) 


as m—^ oo. 


Proof: Recall from | |4.29| l that for fixed i, the collection of events {{i ^ j}: j f i] are pairwise 
independent and have the same probability pi. Thus 

.jElm]Pj^{i^ j}\ Pi ( ] Pi 

-Fp> — )=^[S'’r‘ 




and 


Var 


'Tjelm] Pj'^ii^j}' 


PI 


, y 

(Tip) * ^ 


-Var(]l{i j}) ^ Pi- 


je[m] 

This completes the proof as maxie[m] Pi - Pi ^ 0 and pi/aip) 6i under Assumption |4.4[ 

■ 

Lemma 4.22. Under Assumption \AA\ for each fixed i ^ 1, 

Pj P 


max 


0, asm^ oo. 


j-.i'-^j a{p) 

Proof: Fix e >0 and write 

Af(m) {J : pj ^a(p)£} and Ueim) = |^(m)|. 

Note that byAssumption|4.4[ for every e > 0, {nelm): m 3^ 1} is a bounded sequence. Further, 


( 14.291 1 and Markov’s inequality yield 


P/ 1 V- 

max —>£)^ y Pi ^ nE(m)pi ^0, 


J-.i'^j CT(p) 


as max/e[TO] pi = ^ 0. 
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Recall that S>ot( 0 is the set of children of verte x i in For later use let ^^(0 := l®m(i)l 
denote the degree of i in Note that Lemma [4.21 [ together with the lemma just proven 
gives 

p 




oo, 


as m—>- oo. 


(4.34) 


Lemma 4.23. For each fixed m, let q(m) {qi, q 2 ,... qd) be a probability mass function with 
qi >0 for all i, m where d - dim) oo as m ] oo. Assume further that q^iax max/ei^/j qt 0 
asm^oo. Let{U'f"^ i ^ d} be Ltd. uniform random variables and consider the function 

d 

Wm{t):^Y. diHuT' ^t]-t, t e [0,1]. 

i=\ 

P 

Then supfg[0,1] \'^mit)\ —»0 a5 m ^ oo. 


Proof: Recall the proof of Lemma 4.9 where we studied the tightness of the tilt. Then re¬ 
placing p in the proof hy q, the quantity of interest is sup^^jo \ Wmit) \ = (7(q)^i(m) where 

(m) is as defined in 


and (T(q) := y'Li q^- Now 1 14.14t and 1 14.15t imply the existence of 
a constant C (independent of m) such that for all m and e, 

P( sup \Wmit) \ > x0-(q)) ^ exp(-Cxlog(logx)). 

fe[0,l] 


Since aiq) ^ ^ 0 as m ^ oo, this completes the proof. ■ 

We now have all the ingredients for the proof of Proposition l4.2d| We prove parts (a), (h) 
and (c) one hy one. 

Proof of Proposition [4^^ ([^ : Recall from Definition 


4.16 


the tree r\y that contains all the 
edge lengths and huh information in hut ignores root-to-leaf measures and lead values 

0(„)(•). By Corollary 15] or Proposition 3], for fixed / ^ 1, we have 


I dfr} : ^ o) 


(4.35) 


with respect to the product topology on O/'^oT/'/. Using Lemma [42T| Lemma [4l^ and Sko- 
rohod embedding, we assume that we are working on a prohahility space that supports a 
sequence of unordered p-trees sampled vertices ju-'"’ :1^ m^l| us¬ 

ing the associated sequence of prohahility mass functions {p(m): m ^ l}, an ICRT and 
sampled vertices | : 1 ^ ^ /| using the mass measure such that the following hold: 

(A) Convergence in ( 14.351 1 happens almost surely: 




as m ^ oo 


(4.36) 


coordinatewise, where the underlying tree corresponding to is spanned hy the root 
of and U]'”’, 

(B) Writing Smii) ■- Lye® (/) Pv for the sum of weights of children of i in 3^^^°, we have 


Smif . T 1 a.s. 

- -. 1^1 

cr(p) 


(0C i>l) 


(4.37) 


coordinatewise. (We can assume that this holds because of Lemma l4.211 ) 
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(C) For fixed hub \ and m ^ 1, write 

Pv 






V e : 


lii), 




max 

m,i 


:= max qm.iiv). 

veSimli) 


(4.38) 


Then we assume (using Lemma 4.22 and ( |4.34t ) that for all i ^ 1 


q 




^Oand dm{i)^oo. 


Now, for each z g [m] and i ^ 1, if i e [p,z] (where p = is the root of write 

cii'yZ) G @to(z) for the child of i that is the ancestor of z. Next, construct a collection 
{Um,iii^): ^ l,v e [m]} of uniform[0,1] random variables on the same space such 

that 

(a) z ^ 1, izG [m]} are jointly independent for each m^l; and 

(b) for each z ^ R and j for which z g [p, Um,i (c(z; l/j'"’)j is a constant sequence (in 

m) eventually. 

As described below Theorem |4.15[ we can use these uniform random variables to gen¬ 
erate the sequence of ordered p-trees from as follows: Let Um,i ■- 

{Um,iii^) ■ V G Q>yn{i)]. Think of these as “ages” of the children and arrange the children from 
left to right in decreasing order of their ages. 

Once this ordering has been defined, we can construct the function 0(„)(•) as in ( |4-18 l. 
In this case we can write this function explicitly in terms of the associated uniform random 
variables as follows. Define 

C)(„),i(z) := ^ 3 ^] 1 {zg [p,z]} Y. <Um,i{c{v,z))] 


cr(p) 

11 {z G [p,z]} 




^ ^ q^4vn{Um,iM<U^,iiciv,z))}. 


Then 


(cr(p)) ^0f,„,(z) = ^ (D(„,,,(z). 


(4.39) 


i^R 


Similarly the root-to-leaf measure (recall l |4.331 ) can also be expressed in terms of this 
function. 

Now using | |4.361 l, for every fixed hub z ^ i?, j ^ /, and a.e. sample point o), one of the 
following two holds: 

(a) z ^ [p, in which case there exists m - mico) such that z ^ [p, V)'"’] for all m > mict)). 

(b) z G [p, 1/!°°’], in which case there exists m - mito) such that z g [p, I/)'"’] for all m > mio)). 
When the latter happens, using LemmaliT^together with | |4.37t and l |4.381 l, we get 


O,^uiVp-0iUm,i(cii-,vp] 


0 as zn ^ oo. 


By construction, Um,i (c(z; 1/!'"’) j is eventually constant in m on the event |z g [p, 1/!°°’] |. This 
immediately implies convergence of the (scaled) truncated leaf values 0f„)(V^j'"’)/cr(p) (see 
( |4.391 l) for 1^7^/, and similarly the truncated root to leaf measures Q‘7m)' jointly with the 


convergence in | |4.361 l and thus yields the convergence 5? 


im),R 

u 


^ioo),R 

'u • 






























40 


BHAMIDI, VAN DER HOFSTAD, AND SEN 


Proof of Proposition |4.20| (0: Recall from Section 2.2 that is obtained by applying the 

stick-breaking construction to [O.rjj], and leaf j+ in corresponds to the vertex coming 
from rjj. It is easy to see from the definition of and that it suffices to prove 


0 ^ £ i&UVj) - ^ 0 , asR^oo. 

i=i 

For every hub i ^ 1 and leaf rjj, write {i rjj} if r]j is a descendant of i (namely i e [p,rij]). 
Then note that 




/ oo oo / oo 

7 = 1/=R+1 fc=l 7 = li=i?+l 


Thus, it is enough to show that given e > 0, we can find R-Rie) < oo such that > e) < e. 
To this end, first choose JQ large enough so that Pirjj > iQ) < ell, and then choose R^ large 
enough so that 


JK, 


Y.e]<ei2. 




Then note that 


> e) ^ P(77/ > J<(e)-t P / ^ 0; 11 {ith hub appears before time JQ} > £ 

I ] 

e I °° 

^ 9 + “ E “ expi-d/Jiff)) 

^ ^ /=R,. + 1 


< £ 


by the choice of Re, 


where the first term in the second inequality follows from the choice of Ke, while the second 
term comes from the stick-breaking construction of using the countable collection of 
Poisson point processes. This completes the proof. ■ 


4.20 


Proof of Proposition 

made up of 2/ -t 1 coordinates: 


Recall that the tree 


(and can be thought of as being 


(a) 


(b) 

(c) 


One coordinate for the shape and edge 
than I namely rl'?’ (see Definition 




4.16 . 


ength information along with the labels smaller 


Note that this is the same for both and 


^^ 1 } . 

J coordinates for the leaf values 0(„)(V/)/cr(p) (resp. 0f„,(Vj)/cr(p)). 

/ coordinates for the measured metric spaces ([p, Vd™’], Qj/'"’) (resp. := 

([p,Fp, Qj,"”''')). 


Since T*^ assumes the product topology on these coordinates, it is enough to show the re¬ 
quired estimate in Proposition |4.2Q| @ with functions of the form 

/(t,(adi^7^/,(My)i^^-^/):=F(t) 0 SMi) 0 hjiMj). 


Here teTjj, uj eU are associated leaf values and Mj are the paths from the root to leaf j 
with an associated probability measure and /, gj and hj are bounded uniformly continuous 
functions on the spaces T//, P and y' (measured compact metric spaces) respectively. To 
simplify notation, we will simply write this as /(t). 
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Now we can go from to by flipping one coordinate at a time. Thus writing 

//-'’(t):=F(t) n SMi) n hjiMj), 0 gjiad fl hjiMj), 

we get 





Since Vj’s have been sampled in an i.i.d. fashion from p, it is enough to show that for any 
two bounded uniformly continuous functions /z, g on IR and y' respectively, 


limsuplimsup E 

R^oo m^oo 


, or(p) j O'(P) j j 


= 0 , 


(4.40) 


and 

limsuplimsup E(|fz(.y^^'”’) - fz(.y^^'”’'^)|) = 0. (4.41) 

R^oo m^oo 


Now consider the measured metric spaces and As remarked above, they share 

the same metric space, namely the path [p, The only difference is in the associated 
probability measures. Consider the natural correspondence C = {(x,x) :xg [p, Vj'""’]} be¬ 
tween and Further, define a probability measure n on [p, x [p, iC""’] as 


\LuPu^{u e ^^(z, [p, V^''">])}/0,„,(y;™’), if i e ip, and i ^ R, 

Writing iii and ti 2 for the marginals of n, we have, using the above choice of correspon¬ 
dence C and of the measure n, 


^ (ii^i - QiT’ii+11^2 - Qr"ii) 


2[0 




(4.42) 


Now suppose we show ( |4-40 l. 

-IrK r\r{m), d 


&miyi) 

Using part ijil and part 0 of Proposition |4.20[ we get 


(cr(p)) ^0(,„)(V'j‘'”’) —► 0(^)(V]‘°°’) >0. Now using the bound in | [02] l and uniform continuity 


of h, we see that | |4.411l is tru e. Hence it is enough to prove | |4.40h 

Recall from Section 4.2.1 the construction of and the tree simultaneously via the birth¬ 

day construction, where is obtained as the value before the first repeat time, namely 
Yrj-i. Fix £ > 0. By ^7^ Theorem 4], under Assumptions |4.4| we may choose JQ large so that 
the first repeat time satisfies P(i?i > JQ/crip)) < e for all m ^ 1. Next, by uniform continuity 
of g, choose 6 e (0,1) such that |g(x) - g(y) | < £ if |x - y| <6. Finally choose R large so that 
for all m. 




L 


Pi 




< £. 
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First, by choice of Kg and boundedness of g, 


g 


cr(p) 


-E 


g 


cr(p) 




o-(p) 


llglloof, 


(4.43) 


and a similar inequality holds true if we replace the functional 0(„, by (Sf^y Next, writing 





o-(p) j cr(p) 


MRi 


Ke 

(Tip) 


we have 


C(^)^£ + 2|lglloo 




Ke 

crip) ’ 


{(3uvr)-^Livr'^) 

crip) 



(4.44) 


by our choice of 5. The difference 0(m)(^i™’) “ is a tricky object for which we will 

need a tractable upper bound. Recall that we have used for the birthday tree in | |4.7| 
constructed by time Ri. For every vertex i e 3'^, let ^(i) be the first child of i in the birthday 
construction (the first new, i.e., previously un-sampled vertex sampled immediately after a 
prior sampling of i). This will be an empty set if i is a leaf in the eventual full tree 3'^. Recall 
that {i'^ j} was used to denote the event that j is a child of i in 3'^. Then note that 


i^[m] 


Thus, 


i^R+l 


[^avn-(5uvn) 

^ " cr(p) ’ cr(p) 


cr(p) 

I m 

I I I 

^ i=R+ljelm] 


Pj 

cr(p) 


f 


Kr 


i appears before-, i j, j ^ 0ii) 

crip) 


--iSZiR). (4.45) 


For i ^ j e [m], define the event Eij |i appears before i ^ j,j ^(i)|. Then for Eij 
to happen, the following needs to happen in the birthday construction: (a) There is an 0 ^ 
ri ^ Ke la ip) such that till time ri, neither i or j have been sampled, (b) At time ri + 1 vertex 
i is sampled, (c) There is an r2 ^ 0 such that in the times [ri + 1, ri + 1 + r2] samples, j does 
not appear, (d) Then at time ri + r2 + 2, vertex i is sampled again, (e) In the next time step 
ri + r2 + 3 vertex j is sampled. Therefore, 


Kelcrip) oo 

PiEij)^ E L PiPiP j ^ 

ri=0 r2=0 


2 Ke 


Using this in l |4.45| l, we get 


1^ tn 


^ e, by our choice of R. 


(4.46) 


Combining ( |4.43t , | |4.44| l, ( |4.45| l and l |4.461 l now gives the following lemma which completes 
the proof of l |4.401 l and thus the proof of part @ of the proposition. ■ 
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Lemma 4.24. Given e > 0 choose iQ, 6 and R as above. Then, for all m^\, 




[ o-(p) 


-E 


g 


crip) 


e(4||g||oo + l)- 


Proof of Theorem 


4.18 


We now prove continuity of the function on the space T* 


i,ik+e]- 


In fact, we will give a quantitative estimate. Since we are assuming the discrete topology 
on the coordinate corresponding to the shape, without loss of generality we will work with 
two trees t,t g T* having the same shape. We need to distinguish the labels for the root 

and the leaves in the two trees; so write 0 + (respectively 0 +) for the root of t (respectively 
t) and write {j+:l^ j ^k + i] (respectively + : 1 ^ ^ A: + z|) for the collection of leaves 


in t (respectively t). Finally, let vj be the corresponding probability measure on the path 
.J{j [ 0 +, 7 +] for 1 ^ j ^ k, and analogously let vj be the probability measure on 
[ 0 +, 7 +]. View these paths as pointed measured metric spaces pointed at the roots 0+ and 
0+ respectively. Now let Cj where pointed Gromov-Hausdorff- 

Prokhorov metric defined in Section lzTTl 

Write L- ( 2 ). Let <p :U^ ^ R be a bounded continuous function. For K > 0, let OiK) - 
[ 0 , K]^, and for d > 0 , define 


osc 0 (d,Jr):= sup \(pix) - (piy)\. 

x.yeniK] 

l|x-y||oo<5 


Finally, define 


k 

£ := 4 ^ ey + (k + 1) ^ I leit) - fit) 

7 = 1 e 


(4.47) 


where fi-) denotes the length of the edge e and we have used the fact that both trees have the 
same shape. Write ht(t) for the height of tree t (not graph distance, rather in terms of max¬ 
imal distance from the root when incorporating edge lengths). The following proposition 
completes the proof ofTheorem |4.18| 


Proposition 4.25. For two trees t, t e T* 


having the same shape, and with e as in l |4-47K 


IS'J’(t) - gj’(t)l ^ 2£||0 ||oo + osc^ (e, 2ht(t) +2ht(t)]. 


Proof: For each j ^ k, choose a correspondence Cj and a measure ttj on the product space 
[ 0 +, 7 +] X [ 0 +, 7 +] such that the following conditions are met: (a) ( 0 +, 0 +) g Cj] (b) the 
distortion satisfies dis(Cy) < Ssj-, (c) the measure of the complement satisfies rijiCp < 2ef, 
(d) and finally 

||vy-p*7r^|| + ||v;-p^:T:y|| <2£;, (4.48) 


where p^,nj and are the marginals of rij. Now sample iXj,Xj) ~ ttj from [0+,_/+] x 
[ 0 +, 7 +] independently for 1 ^ j ^ k. By | |4.481 l, we can couple iXj,X*) with two random 


variables Xj,Xj (again independently for I ^ j ^ k) such that Xj ~ vj and Xj ~ vj, and 
further 


p(Xj^X*] + p(Xj^X*)<2ej. 


( 4 . 49 ) 
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Using conditions 10 and we get 

p( dt(o+,X*)-dt(o+,X*) >3£y)^2e;, (4.50) 

where dt is the distance metric on tree t which incorporates the edge lengths. Now write E 
for the following “good event”: 

k 

£:= = = dt(o+,X;)-dt(o+,X*) ^3£y}. 

7 = 1 

It follows from l |4.49| | and l |4-501 l that 


P(£")^4X£;. 
7 = 1 


(4.51) 


Now we are going to create “shortcuts" hy gluing the leaves to the corresponding sampled 
points. Let S (resp. S) he the (random) metric space obtained hy identifying each of the 
leaves 7 + (resp. 7 +) with Xj (resp. Xj) in t (resp. t) for 1 ^ j ^ k and write ds (resp. d^) for 
the induced metric. Then hy definition, 


<(t) = E 


(/)(ds((A:+ i'l) 


+, {k + 12 )+): 1 ^ i'l < iz ^ 



and an analogous expression holds for (t). This gives 

I W - g^*"’ (t) I ^ E ( (p[ds iik+h)+, ik+ iz) +) :l ^ h < £] 

— (p |(^ + ii)+> (.k + 12 )4j: 1 ^ < ^2 ^ ^ 


(4.52) 


Consider the map from t to t which takes every vertex to the corresponding vertex and 
points on each edge are mapped hy linear interpolation (using the edge lengths) to points 
on the corresponding edge. Consider a e [0,j+] and let d e [ 0 , 7 +] he the corresponding 
point in t for some 7 ^ k. Then note that 


di{^(i,Xj^ — d^^u,X ^ dt^0+,^^7) ~ "^7j 

^3£7 + ^|/e(t)-Ze(t)| 


+ 


dt( 0 +,a) -d(( 0 +,a) 


(4.53) 


on the set E. 

Now consider a shortest path in S connecting {k + ii)+ and ik+ ( 2 )+. We can go from 
(A: + /i)+ to (A:+ 12 )+ hy taking the same route in S, i.e., hy traversing the same edges and 
taking the same shortcuts in the same order. We make the following observations: (i) The 
difference between distance traversed while crossing the edge e is | /e(t) - led) |. (ii) By ( |4-53| , 
on the set E, taking a “shortcut" contributes at most {3ej + Y.e I hd) - hd) I) to the difference 
between distance traversed. Since we have to take at most k shortcuts, we immediately get 


d^i{k + Zi)+, (A: + 12 )+] ^ ds ((A: + Zi)+, (A: + 12 )+) + 3 ^ £7 + (A: + 1) ^ |Ze(t) - /e(t)| 

7=1 e 


on the set E. By symmetry, a similar inequality holds if we interchange the roles of S and S. 
This observation combined with and yields the result. ■ 
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5. Proofs: Convergence in Gromov-weak topology 


Recall from Proposition |4. 1 1 that conditional on the partition of the vertices i ^ l} 
into the connected components, the actual structure of the components of ^^(x, t) can 
he generated independently as the connected graph where are as in 

Proposition |4.l| and given m,p, a, is the connected random graph model studied in 

the previous section. For Theorem |1.81 the time scale f = of interest in the expression of 
Un is 


A + 


1 

cr2(x<">) ’ 


for fixed A e K. Let ^([R+) denote the space of counting measures on U+ equipped with 
the vague topology. Define Y“ := (py/crip), v e 7'“’) and view (a^’c7(p“), Y“) as a random 
element of S := [R+ x ,yL(IR+) (equipped with the product topology). Finally, define 

5^„:=((aXp<‘>),Y®):i>l) 


viewed as an element of S°°, again equipped with the product topology induced hy a single 
coordinate S. Now given an infinite vector c e /q recall the process as in I jl.lGt , the cor¬ 
responding excursions as in | |1.171 l and the corresponding excursion lengths in ( |1.18 l. 
Finally recall the definitions of from l |2.10| l. Writing these out explicitly, define 


J>1) = 


ZiU)! E 


veZiiX] 




X veZiiX) 


: jeZiU)]: i ^ 1 




Propositions.!. The following hold under Assumption\l.6\ 


(i) For every i ^ 1, cr(p‘^') 
d 


(ii) 


on S°° as n 


0 as n—” oo. 

■ oo. Further for every fixed i ^ 1, almost surely, 
E Cy = oo. 


(5.1) 


veZdX] 


Proof of Theorem ] 1.8| We prove the theorem assuming Proposition |5.1[ By an application of 
Skorohod embedding we may assume that we are working on a prohahility space where the 
con verg ence in Proposition |5. l| happens almost surely. In particular, in this space, Assump- 
is satisfied almost surely for p‘^’ for any fixed i ^ 1. Now an application of Theorem 


tion 


4.4 


|4.5| completes the proof. 


5.1. Verification of weight assumptions in maximal components. Here we give the proof of 
Proposition |5.1[ To ease notation, we will throughout assume A = 0. The general case follows 
in an identical fashion, hut this assumption simplifies notation. We will write instead of 
for the process in l |1.151 l with A = 0 and simply write for ([ 0 ' 2 (x‘"’)]”^). 

We start hy describing an exploration scheme (developed in |[^) which simultaneously 
constructs the graph ^^„(x, t) and a “breadth first” walk. This was carefully analyzed in 
to prove Theorem |1.7[ 

For every ordered pair iu,v), \etriu,v be an exponential random variable with rate tXv (in¬ 
dependent across ordered pairs). Note that there is a simple relation between the connection 
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probabilities of t) given by l |1.10| l and the above random variables given by: 

^UV ^ Xjj). (5.2) 

At each stage i ^ 1, we have a collection of active vertices a collection of explored 
vertices 0ii) and a collection of unexplored vertices '%'(/) = [n] \ u (9 (i). 

Initialize with 0(1) - 0 and s^{l) - {r'(l)}, where the first vertex i'(l) is chosen by size- 
biased sampling, namely with probability proportional to vertex weights x. When possible 
we will suppress dependence on n to ease notation. Now let Qiivil)) {v.rjumjj ^ x^d)} 
denote the collection of “children” of uil) and note that by ( |5.2t this generates the right 
connection probabilities in t). Think of the associated rj vii],v values (for vertices con¬ 
nected to i'(l)) as “birth-times” of these connections in the interval [O.x^d)] and label the 
corresponding vertices as u{2), u{3),... v{,\3>(u(l))\ + 1). Update the process as 0(2) {i'(l)}, 
s^i.2) := @(i/(l)) and ^(2) = '^(1) \ @(i^(l)). 

Associate with this construction a breadth-first walk as follows: 


Z„(0):=0, Zniu):^ -u + Y.XvT^iVvm.v^u}, O^u^x^d). 

V 

Recursively for i ^ 2 let Ti-i := At this “time” we will explore the unexplored 

neighbors of v{i). By this time, there are |^(z)| := z - 1 -t \s^{i)\ vertices that have ei¬ 
ther been explored or are active. Let 3){v{i)) {ve ^{i) : ^ Xt,(d} and again label 

these as v{i -t |.a/(z)|), v{i + \s^{i) \ -t 1),... v{i -t |.c/(z)| -t |@(z;(z))| - 1) in increasing order of 
their riv{i)v values. Update 0[i -t 1) = 0ii) u {zz(z)}, -t 1) = s^(i) u S>(zz(z)) \ [vii)} and 
(z -t 1) = '^(z) \ ® (zz(z)). Again update the walk as 

Z{Ti-i + u)^ Z{Ti-i)-u+ Y. Xi;ll{rit,[i)^v^u}, 0^u^Xv[i). 

ve&ivii]] 

After finishing a component (which happens when si (z) = 0 for some z ^ 2), choose the next 
vertex to explore in a size-biased manner from the unexplored set ^(z). If '%'(/) = 0, then we 
have finished constructing the partition of the graph into the connected components. 

Now note the following important properties of this exploration: 

(a) The ordering (z;(l), zz(2).zz(zz)) is a size-biased reordering of the vertex set [zz]. 

(b) If we start a new component at some stage z with vertex uii), and finish exploring the 
component at stage j ^ i, then the walk satisfies 


z(Tj) = z(r,_i) -Xy( 


Z(zz) ^ Z(Tj) on Ti-i <u< T;. 


Thus, the size of the component of v{i), £ . Xy[i) is essentially the length of the excursion 
of the walk beyond past minima. 

As a starting point in proving Theorem |1.7[ Aldous and Limic 110 show the following re¬ 
sult. Their result is more general (incorporating the presence of a “Brownian component”) 
but we state their result as applied to our setting. 


Proposition 5.2 (|10 


Z„(s) := Z(s)la 2 - Then under Assumption 


Proposition 9]). Co nsid er the process {Z„(5): 5 ^ 0} defined by setting 

Zn as n—>- oo. 


1.6 


Using this result, Aldous and Limic |[^ show that the corresponding maximal excursions 
beyond past minima of Z„ also converge to the maximal excursions beyond past minima 
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of V^, namely the excursion lengths of the reflected process (see i jl.lGI I) from zero. A 
consequence of the proof of Theorem |1.7| in using Proposition |5.2| is the following result: 

Lemma 5.3. Fix K and letSn iK) be the time required for the above construction to explore the 
maximal K components :l^i ^K}. Then :K^l} is tight. 


In other words, for any fixed K^l, the maximal length excursions of are found in finite 
time. Thus, even though the total weight of vertices cri ^ oo, when exploring the graph in 
size-hiased fashion, under Assumption 1 1. 6| one needs only a finite amount of “time” to find 
the maximal components. Here time is measured in terms of the weight of vertices already 
explored. Now define 

Sn.2iu)^ ^ f^] , Kiu):^ ^ ^]l{x,a)<CT 24 . 

Thus, Sn.zit) is the normalized sum of squares of vertex weights of vertices explored hy time 
t and Rfi is the normalized sum of these squares where we only retain explored vertices with 
weight at most eaz- Using the same set of exponential random variables :j^l} that 
arose in the definition of the process V'^ in ( |1.151 l define a new process 


Soo.2iu) := Y, cjlll^j ^ u}. 
i=i 

The same proof techniques as in now implies the following. Since the ideas basically 
follow from we only sketch the proof. 

implies the joint convergence of the processes (Z„(-),S„,2(0) 

(I^‘'(-),Soo,2(-)) asn^oo. 


Lemma 5.4. Assumption 


1.6 


Proof: Fix K ^ I, and for each i ^ 1, let denote the time when vertex i is added to the 
collection of active vertices. Now consider the + 1 dimensional stochastic process 


Y^(s):= 


Xi . 

Zn{s),—t 

CTz 


. 


s^O. 


Write 


tK 


is) := (y"(s),ci]l{<ri ^ S},...,CKi{^K^S}). 
Aldous and Limic showed that 


In the proof of Proposition |5.2[ Aldous and Limic showed that Y^ Y^ for every fixed 
K ^ 1. Thus to complete the proof, it is enough to show, for every fixed A > 0 and 
q > 0, limsup£^olinisup„^oQP(i?^(A) > q) - 0. Now as described on jlO Page 17], we 
can couple ^ 2 ”^.with a sequence of independent exponential random variables 


.with having rate Xj/az such that ^ 

^ Z. J J J J 




?(n) 


tin) 


in) 


. Now write 


Kit)--- Y 


j:xj<Ea2 ^2 


limsup limsup E(.R^(A)) = 0, 

£^0 n^oo 


Then it is enough to show 
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'^3 


^ E 


]:Cj<E 


which is trivial since 

E{R^„iA))^A Y. 

j-.Xj^eaz 

We have used both ( |1.121 l and l |1.131 l in the last convergence assertion. Thus, first letting 
n^oo and then e ^ 0 completes the proof. ■ 

We can now complete the proof of Proposition |5.1[ First, note that to prove l |5.11 l, it is 
enough to show that for any two rationals r < s, Y.j Cjl{r ^ ^ j ^ s] - oo almost surely where 
are the associated exponential rate Cj random variables. This, however, is trivially true as 
Ljcj = oo. 


To prove the other assertions, define, for i ^ 1, the point processes := {x„/cr 2 : u e 
namely the rescaled vertex weights in the ith maximal component. Analogously define = 

{Ci,: V e 2i}, namely the collection of jumps in the ith largest excursion of V'^. Let 

^2 


M) 


rr2’ 


and 5® := ^ i 
veZi 


for the normalized sum of squares of vertex weights in a component. Define 
:= ((massj^d, 5®, '^n] l), ^co := {{Zi, 5®,E®), l). 

We will view these as random elements of where S: = [R^ x (IR). Lemma ls^ and Lemma 


5.4 now imply the following: 


Lemma 5.5. Asn^oo, SAn 


on S" 


Expressing the functionals that arise in Proposition |5.1| in terms of vertex weights in max¬ 
imal components completes the proof. Indeed, 


n(p®) 


(O', _ 


^2\ 




0 , 


asn^oo. The proof of —► ^oo is similar. ■ 

5.2. Gromov-weak convergence in Theorem |1.2j That convergence in | |1.71 l holds with re¬ 
spect to Gromov-weak topology is an easy consequence of Theorem|1.8[ Indeed, setting 


t-2 1 - 

Xi - n 1^-1 Wi and = — (1 + An 

r n 




)n 


2 (t-2) 

T-1 


we can write NR„(u^(A)) as the model tn) where x = x®’ := (x,: i e [n]). A direct compu¬ 
tation will show that x®’ satisfies Assumption [l^ with the entrance boundary c“ defined in 
( |2.111 l. Note also that 


tn 


(72 (X'">) 


2(t-2) 

n T-i 






ie[n] 


inin 


Under the assumptions of Theorem 
flTl Lemma 2.2], 

i 2 

T ^ 

ie[n\ 


1.2 


£n/n^ EW and X/ uj^ln EW^ = E W. Further, by 


1 + Cn 




+ o{n 


-(t-3)/(t-1) 
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where (is as defined in ( |2.12t . Combining these observations, we see that 

tn - ((72(x‘"’))“^ ^ t™ as n^oo, 

where t™ is as in \2.\2) . Since ^ E VK, we conclude that M™(A) defined in 

( I2.131 I is the Gromov-weak limit of where M“(A) is as in | |1.6k 


Remark 7. Theorem 1.8 is stated for a fixed A g IR, but in the argument just given, we have 
to work with a sequence, namely tn - (cr 2 (A:'"’))“^ converging to This, however, does not 
make any difference. Indeed, the proof of |[^ Proposition 9] can be imitated to prove the 
same result in the setup where we have a sequence converging to t instead of a fixed t, and 
no new idea is involved here. (In Lemma 27], Aldous and Limic prove a similar result for 
the multiplicative coalescent. They do not, however, explicitly state the convergence of the 
associated process under the same assumption.) 

6. Proofs: Convergence in Gromov-Hausdorff-Prokhorov topology 


In this section, we improve Gromov-weak convergence in Theorem 1.2 to Gromov- 


Hausdorff-Prokhorov convergence. To do so, we will rely on |[^ Theorem 6.1] which gives 
a criterion for convergence in Gromov-Hausdorff-weak topology. We do not give the defi¬ 
nition of Gromov-Hausdorff-weak topology and instead refer the reader to Definition 
5.1]. Convergence in Gromov-Hausdorff-weak topology implies convergence in Gromov- 
Hausdorff-Prokhorov topology when we are working with metric measure spaces having full 
support (i.e., the support of the measure is the entire metric space). This is true in our sit¬ 
uation. Indeed, it is a trivial fact that (A) has full support. Further, the mass measure on 
an inhomogeneous continuum random tree has full support which implies that the same is 
trueforM“(A). 

Applying Theorem 6.1] to our situation, we see that it is enough to prove the following 
lemma: 

Lemma 6.1 (Global lower mass-bound). Let be the ith largest component of 

NR„(u/(A)). Then the following assertion is true: For each i ^ I, v e [n] and 6 > 0, letB{v,5) 
denote the intrinsic ball (in NR„(m/(A ))) ofradius5n^^~^^^^^~^^ around v and set 


m5."^(d) =inf^ ^ 

I jeBiv.d) 


Wj 


ve%a) 


Then the sequence | is tight. 


Lemma 6.1 ensures compactness of the spaces MH(A) which, in turn, implies compact¬ 


ness of the spaces Mf (A) when c = (ci, C 2 ,...) is of the form l il.191 1, thus proving the first as¬ 
sertion in Theorem ll.9[ 


Before moving on to the proof of Lemma [6Hj we state a result that essentially says that 
instead of looking at the largest components, we can work with the components of high- 
weight vertices. This observation will be used to prove the global lower-mass bound: 

Proposition 6.2. For every e > 0 and k^\, there exists K - K{e,k,X)>b such that 

P([k'] nh^’iiX) - 0 forsome \ ^ i ^ k] 

Proposition |6.2| follows trivially from Theorem 1.6 (a)] and 1^ Theorem 1.1]. 
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6.1. Bound on size of ^^-nets for the largest components. For convenience, we 

set 

77 = (t-3)/(t- 1) and p = (t-2)/(t-1). (6.1) 

The purpose of this section is to prove a strong result (Proposition |6.3| stated below) that 
gives control over the number of intrinsic balls of radius en^ needed to cover the largest 
components. This acts as a crucial ingredient in the proof of Lemma |6.1| as well as the proof 
of the bound on the upper box-counting dimension. 

Proposition 6.3 (Small diameter after removing high-weight vertices). For every e,6 > 0, and 
N^Nie) 

P(diam(NR„(M7(A)) \ [N]) > en^] ^ c^exp , (6.2) 

for all n sufficiently large, a positive constant cg depending on 6 and a universal constant 
C > 0. Here NR„(m7(A)) \ [N] denotes the graph obtained by removing all vertices with labels in 
[iV] and the edges incident to them from the graph NR„(m7(A)). 

We continue to prove Proposition |6.3[ Write 

En = {diam(NR„(M7(A)) \ [N]) ^ en^}. (6.3) 

The proof consists of four steps. In the first step, we reduce the proof to the study of the 
height of mixed-Poisson branching processes. In the second step, we ensure that we can 
take A = 0, while in the third step, we study the survival probability of such critical infinite- 
variance branching processes. In the fourth and final step, we prove the claim. 

Comparison to mixed-Poisson branching processes. Let ^res(*) be the cluster of i in the 

(restricted) random graph on the vertex set [n] \ [i - 1] with edge probabilities qj^riwiX)) for 
k,£ e [n] \ [i - 1], where qicgiwiX)) is as in l |l.l| l. 

Note that the event implies the existence of i > iV such that the following happens: 
(a) The diameter of the component of i in NR„(m7(A)) \ [N] is bigger than en'^. (b) No 
j e {N+l,...,i-l} belongs to the component of i in NR„(m7(A)) \ [N], In particular, 
diam(^res(*)) ^ for this i. Thus, 

P(£^) ^ ^ P (diam(^res(0) > en^). (6.4) 

i>N 

Now the random graph NR„(m 7(A)) restricted to [n] \ [i - 1] is the Norros-Reittu random 
graph NR„(m/®(A)), where w'-'fX) = : j e[n]), - Wj[X)£n !£n for j e[n]\[i - 

1] and wfiX) - 0 for j e [i - 1], and £n - Wk- Indeed, this follows from the simple 
observation 

u7“(A)u7“(A) X\w]cW{ 

I”=,u;®(A) £n 

Write Wff (A) for a random variable whose distribution is given by {n-i + 1)“^ £”_ • 5 m,.,, 

J — I W J {A) 

and for any non-negative random variable X with EX > 0, let X° be the random variable 
having the size-biased distribution given by 

P(X°^x) = E(X]1(x^4)/EX. 

We will use the following comparison to a mixed-Poisson branching process: 
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Lemma 6.4 (Domination by a mbced-Poisson branching process). Fix i e [n] and consider 
NR„(m/‘'’(A)). Then, there exists a coupling of %esii) and a branching process where the root 
has aPo\lw^\A)) offspring distribution while every other vertex has aPo\{{W'f {X))°) offspring 
distribution such that in the breadth-first exploration of%esil) starting from i, each vertex 
V e ^res(0 has at most the number of children as in the branching process. 

Proof: See |5^ Proposition 3.1]. ■ 

It immediately follows from Lemma [6^ that 

P (diam(^res(z)) > en^] ^ P (ht(r„“’(A)) > en^l2^ , (6.5) 

where r™(A) is a mrxed-Poisson branching process tree whose root has a Poi(M;“(A)) off¬ 
spring distribution and every other vertex has a Poi((VP“(A))°) offspring distribution. As 
before, ht(t) denotes the height of the tree t. 

When ht(r^’(A)) > en^l2, at least one of the subtrees of the root needs to have height at 
least en^l2. Combining this observation with ( |6.4] l and l |6.5t , we get 

J^E[Po\{wfU))]P{ht{Pfa)]^en^l 2 ]^ ^ u;®(A) P (ht(r®(A)) ^ en''/2)C6.6) 

i>N i>N 

where r“(A) is a branching process tree where every vertex has a Poi((W“(A))°) offspring 
distribution. 

We make the convention of writing r“, W“ etc. instead of r“(0), W;^‘’(0) etc. With this 
notation, it is easy to see that W“(A) = (l-i-An“'')W“ and hence (W“(A))° = (l+An“'^)(W“)°. 

The survival probability of mlxed-Poisson branching processes. We would like to compare 
our mixed-Poisson branching process with an offspring distribution that is independent of 
n. For this, we rely on the following two lemmas: 

Lemma 6.5 (Mixed-Poisson branching processes of different parameters). Let Tff and Tff (A) 
be as above. Assume further thatX ^ 0. Then, for each k^l, 

P (ht(r® (A)) ^k)^a + Xn-^)^- P (ht(r®) ^ k]. 

Proof: We follow Proof of Lemma 3.4(1)]. Writing 5 = 1 + Xn~'^, we note that we can 
obtain T® as a subtree of T® (A) by killing every child independently with probability 1 - 5“^ . 
Write sd for the event in which ht(r® (A)) ^ k and no vertex in the leftmost path of length k 
starting from the root in T® (A) is killed. Then 

= 5“^P(ht(r®(A)) ^ k]. 

Indeed, the probability of the leftmost path surviving is precisely 1/5^. To finish the proof, 
note that sd implies ht(r®) ^ k, so that 

P (ht(r®) ^k]^ = 5“^P (ht(r®(A)) ^ k ), 

which is the desired inequality. ■ 

Lemma 6.6 (Stochastic bound by n-independent variable). Under Assumption \l.l\ the ran- 
dom variable {W„)° is stochastically upper bounded by W° where W ~ F, i.e., (W„ )° ^ LF°- 
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Proof: First we make the following elementary observation: if ai,a 2 , hi, hz are positive num¬ 
bers such that 

Ui a2 , Ui ai + a2 0,2 
-r^-T’ then — ^ ^ . 

hi h 2 hi hi + h 2 h 2 

Repeated application of the above will yield the following simple inequality: if and 

{h„}„^i are sequences of positive numbers satisfying 


U\ 0,2 CI3 Cli U\ + 0,2 + <^2 <^3 

— ^ ^ ^ , then — ^^^ , 

hi h 2 hs hi hi + b2 bi + b2 + ha 


(6.7) 


Recall that l denotes the leftmost point of the support of F, and note that from l |1.21 l it fol¬ 
lows that /“ / = j/n, j = 1,2 . n (note also that Wn - <-)■ Define the function h„ : [l, Wi) 

{ 1 , 00 ] by f =l/n. This immediately implies 

fihniy)W^iy)^fiy). (6.8) 

Let gn : [i, Wi) (0,oo) be given by 

y 


gniy) 


uf{u)du 


A direct computation and an application of ( |6.81 yields 

a hniy) ,2 rhniy) 

ufiu)duj g^(y)= J uf(u) du-yf(y)[h„(y)-y}. 

Since ufi u) is non-increasing on [i, 00 ) under Assumption ! l.l[ we conclude that (y) ^ 0 on 

(f, Wi). Thus, g„(-) is non-increasing on [i, Wi). By right continuity, we can define gniu>i) - 
uf{u) du). Since Wn ^ Wn-i ^ ... ^ Wi, we conclude that gniu>i) ^ gnit^z) ^ ... ^ 
g„(M/„). Clearly h„{wj) = wj-i for 7 = 2. n. Thus 

Wi W2 Wn 

Iwi du " ufiu) du"" 

Now an application of l |6.7| gives 

Wi + W2 + ■■■ + Wjc ^ Wi + W2 + ■■■+ Wn 

/“ uf{u) du 


/, " ^ ufiu) du 


IZufiu) du 


, k= 1,2 . n, 


which is equivalent to 

P(«>)° ^ Wk) = + + ^ /^^u/(u)du ^ ^ k=l,2,...,n. 

Wi + W2 + ...+ Wn ufiu)du 

This concludes the proof. ■ 

We continue to study the survival probability of mixed-Poisson branching processes with 
infinite variance offspring distribution: 


Lemma 6.7 (Survival probability of infinite-variance MPBP). Let T denote a mixed-Poisson 
branching process tree with offspring distribution Poi(W°). Then, there exists a constant 
such that for all m^l, 

P(ht(r) ^ m) ^ 
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Proof: This is a well-known result. We sketch the proof briefly for completeness. Recall the 
following facts about W°: (a) E[W°] = v = 1 and (b) forx^ oo, P(W° > x) = cx“^^“^Hl-to(l)). 
By the Otter-Dwass formula, which describes the distribution of the total progeny of a 
branching process (see for the special case when the branching process starts with a 
single individual, for the more general case, and ||4^ for a simple proof based on induc¬ 
tion), we have 


P(|r| = A:) = 


£x, = A :-1 , 


U'=i 


where Xi are i.i.d. random variables distributed as W°. By ||^ Proposition 2.7], in our situa¬ 
tion, Xi - k - 1) ^ so that 

= and Pi\T\^ k) ^ (6.9) 

Take k = second inequality in I l6.9] l to get 

P(ht(r) ^ m) ^ + P(ht(r) ^m,\T\^ , 

where j Tj denotes the total number of vertices in T. We condition on the size j Tj and write 


,(t-2)/(t-3) 


P(ht(r) ^ m, jrj ^ m 


(t-2)/(t-3) 


)= Y. P(ht(r) ^ m I jri = A:)P(|r| = k) 


k=l 

^(t-2)/(t-3) 


Y P(ht(r) ^ m I jrj = A:) A:' 

fc=i 


T-1 

■ t-2 


( 6 . 10 ) 


By 150 Theorem 4], there exists a 7C> 1 such that, uniformly for u^l, 

P(ht(r) ^ I jrj = A:) ^e“““". 

Combining this with dO.lOj l, we get 


t-2 

mT-3 


i f V ^ ^ t3 \ ^ \ T 1 

’(ht(r) ^ m,jrj ^ ^ Y exp|-a|mA:“^| |A:“^=0( 

fc=i ^ ^ 


m 


-1/(t-3) 


)- 


as required. 

Proof of Proposition |6.3| Clearly 

P (ht(r®) ^ m) ^ E [(W®)°] P (ht (r®) ^ m - 1) =: v® P (ht (T®) ^ m - l), 

where 


v® = E[«>)1 = 


I;> / (wff Zj^i 


W; 


^n I Zj^i ^ j 


(n 


Iterating this euP!A times, we get 

P(ht(r®) ^ en''/2) ^ (v'^Y^^'^P (ht(r®) ^ en''/4) 


( 6 . 11 ) 


^ (v®)^"’''^P(ht(r) ^ en''/4) ^ (v®) 


(iKe«''/4 


/ .xI/(t-3) 






where the second inequality is a consequence of Lemma |6^ and the last step follows from 
Lemma [6771 
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Substituting the estimate ijeTT) into 1 16.61 1 leads to 




1 + 


max{A,0} 

^ j 


, l+en ’^12 


L 

i>N 


(iUEn ’^14 


( 6 . 12 ) 


for some constant c. Here we have used Lemma 


6.5 


and the simple fact that w . ^ wt. 


Next, note that it is an easy consequence of I ll.St that there exist constants c', c" > 0 such 
that for all i^[n\, 


uy^c(-) 


and 


7=1 7=1 * 


(6.13) 


Further, 17 Lemma 2.2] implies that v„ < 1 for large n. Hence, for every i ^ 2, 

< = < - ^ L ^ 1 - ^ exp {-Cn-^f) 

j=i 

for some C > 0. Here, we have used the second inequality in I l6.13| . Combining this estimate 
with 1 16.121 and the first inequality in I l6.13h we end up with 

P(£^) ^ ^ r^'^^“^’exp(-Cei''/4) 

i>N 

for some C' > 0. Taking N - we arrive at 

P(£^) ^ Y. exp(-Cei''/4) 

i>N 


^ ^ ^ exp(-Cei'^/4) 

k=0 i=N2<^ 

oo . . 

^ £ 2^exp (-Ce(iV2^)'’/4). 

fc=o ^ 


(6.14) 


Note that eW -e A little more work after plugging this into 1 16.14^ will lead to I l6.2k 


6 . 2 . 

6 H 


Proof of global lower-mass bound. In this section, we complete the proof of Lemma 
We start with some preliminaries: 


Lemma 6.8 (Weight of size-biased reordering). Letni,il) - v andin^ii): i g [n]\{l}) beasize- 
biased reordering on [n] \ {v} where the size of vertex v' is proportional to Wyi for u' e [n] \ {v}. 
Then, for every k - o(n), there exists aj>0 such that 

k 

P 3z7: Y ^71 Ai) ^ ^ ne“^^. 

^ /=! ' 

Proof: See flT} Proof of Lemma 5.1]. ■ 

Recall the definitions of p and p from 1 I 6 .II 1 . Recall that for v g [n], B{v,5) denotes the 
intrinsic ball (in NR„ (u^(A))) around v or radius 5n^. We will use the following bound on the 
weight of balls: 
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Lemma 6.9 (Weights of balls around high-weight vertices cannot be too small). For every 
e > 0 and i ^ 1, there exist ni,e large and 6i,E > 0 such that for all n ^ nt^e and 5 g (0 , dj,e]. 


p E wj 



^ nexp 


J6nP \ e 

ji/(T-i) j~'~ 2^' 


(6.15) 


Proof: We rely on a cluster exploration used in which we describe next. We denote 
by (Z/(i))/^o the exploration process of ^(i), the cluster containing i, starting from i, in 
the breadth-first search, where Zo(i) = 1 and where Z^i) denotes the number of potential 
neighbors of the initial vertex i. The variable Z/(i) has the interpretation of the number of 
potential neighbors of the first I explored potential vertices in the cluster whose neighbors 
have not yet been explored. As a result, we explore by taking one vertex of the ‘stack’ of size 
Z/ (i), drawing its mark and checking whether it is a real vertex, followed by drawing its num¬ 
ber of potential neighbors. Thus, we set Zo(i) = l,Zi(i) = Po\iwi), and note that, for I ^ 2, 
Ziii) satisfies the recursion relation 


Z/(z) - Zi-iii) + Xi — \, 

where Xi denotes the number of potential neighbors of the /th potential vertex that is ex¬ 
plored, where Xi - Xdi) - Poi {Wi). More precisely, when we explore the /th potential vertex, 
we start by drawing its mark Mi in an i.i.d. way with distribution 


When we have already explored a vertex with the same mark as the one drawn, we turn the 
status of the vertex to be explored to inactive, the potential vertex does not become a real 
vertex, and we proceed with the next potential vertex. When, instead, it receives a mark 
that we have not yet seen, then the potential vertex becomes a real vertex, its mark M/ e [n] 
indicating to which vertex in [n] the /th explored vertex corresponds, so that Mi e'rfii). We 
then draw Z; = Po\iwMi), andZ/ denotes the number ofpotential vertices incident to the real 
vertex M/. Again, upon exploration, these potential vertices might become real vertices, and 
this occurs precisely when their mark corresponds to a vertex in [n] that has not appeared in 
the cluster exploration so far. We call the above procedure of drawing a mark for a potential 
vertex to investigate whether it corresponds to a real vertex a vertex check. Let 

Z^tnP]il) for t>0. 

Then, by imitating the techniques used in the proof of |[^ Theorem 2.4], we obtain 

( flT) Theorem 2.4] states the result for / = 1. However the exact same proof goes through for 
any i ^ 2.) The limiting process (.Zf (/)) t>o is defined as follows: Let 



a = c]'^^“^Ve[W] and b = fi(/) = 

(6.16) 

We let (^/(f))zs 

,1 denote independent increasing indicator processes defined by 



J^iis] - ]l{Exp(a/”^^^’^”^’) G [0, s]}, 0, 

(6.17) 

so that 

PiJ^iis) =0 VsG [0, t]) =exp(-at//^^^’^”^^). 
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Here (Exp(ai independent exponential random variables with rates 

Then we define 


t/ 

- b - abt + ct+Y —77 — 7- 
J 




at 




J 


(6.18) 


for all t ^ 0, where c - A + ( - ab and (is as in | |2.12t . We call t^o a thinned Levy process. 


Let denote the hitting time of u of the process (i)) f>o. Then, by 17 Corollary 


3.4], —>• the hitting time of u of the process (^f(0)f>o- 

existence of a (independent of n) and an integer such that 


This implies the 


^ B,j) ^ e2-' for n ^ 


(6.19) 


since the limiting process (^f(i))f>o starts from {cp !and takes a positive amount of 
time to reach 

Let \B{i, r) \ denote the number of vertices in 5(i, r). Let dg,, be so small that 

{Cpl2i)^'^'^-^^6p,i<Bp,i. ( 6 . 20 ) 


Then we claim that for all 5 e (0, dpj], 

p(|B(i,d)| ^ ^ ^ 


( 6 . 21 ) 


That ( I 6 . 21 I 1 holds can be seen as follows. For |B(i,d)| ^ [Cpl2i)^'^'^~^^5nP to occur, there has 
to exist some je [l,dn''] such that the number of vertices at distance jfromiis smaller than 
{Cpl2i)^'^'^~^^5nP!{5n^), i.e. 


min \dB{i,in ^)\^{Cpl2i) 

1 ^ j^Sn^ 


1/(T-1)^1/(T-1) 


( 6 . 22 ) 


Now the number of vertices at distance j from i is precisely the number of vertices in genera¬ 
tion j of the breadth-first exploration process, and hence this number (scaled by n^) appears 
in the function JZ'f"’(i). Thus, l |6.22| l implies that (2‘f"’(/))f>o has to hit {Cpl2i)^''^'^~^^ before we 
have finished exploring up to generation dn'', i.e., we must have that 


je^;\{cpi2iY'^^~Y 


|5(Ld)| 


nP 


m 


d < Be^u 


where the last inequality holds by l |6.20] i and because d g (0, de,d. 

Combining ( |6.19| and l |6.21] l, we conclude that for all d g (0, d^^d and n ^ 


p(|5(i,d)| ^ (c^/2/)^'^^“^’dnP) ^ e2~\ 


(6.23) 


This explains the second term in | |6.15k 
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To see what happens when \Bii,6) \ ^ recall that the vertices appear in a 

size-hiased fashion in our exploration process. Hence 




(6.24) 



VH/P-i) SnP] 

E ^ 7 r,( 7 )^[- 

7 = 1 ^ 

7) 2 j 


^ nexp 


J6nP 


ji/(T-i) y 

hy Lemma |6^ Combining I l6.23t and l |6.241 l proves the claim. ■ 

Lemma 6.10. For v e [n], leF^iv) denote the component of v mNR„(u/(A)). Then for every 
fixed i ^ 1 and ei ,£2 > 0, there exists - ^ integer h - hy^E2 


( min y Wj\ ^ £n^\ ^ 62 for n'^ n. 


Proof: Recall Proposition |6.3[ and choose ^ei,e 2 large so that 

P(diam(NR„(M;(A)) \ ^ £in^/2) ^ 1 - £2 


(6.25) 


for all n ^ nE^,E2- L^t 


Fi = {diam(NR„(M/(A)) \ [^£^^£ 2 ]! ^ ein^l2\ and F 2 - {diam('^^(i)) > ein^ll}. 
Clearly, on the set Fi n F 2 , 


mm 


E 


Recall the definition of 5E,i in l |6.201 l, and let 

A 

Then | |6.26t implies 


Wi > ^ mm 


E 


fce[A?£i,£:2l [jeBik,Eil2) 
Ei,E2 - £1 t\ |d£2,l A • • • Ad£2,iV£:j,£2) ^2. 

^ Vj ^ min ^ 

jeB{v,Ei) jEB{k,Aei.s2) 

on the set Fi n F 2 . Hence, for all n ^ n£i,£ 2 , 


Wj 


(6.26) 


mm 


Wi 


Fi n F 2 n t min < Y 

\jeB[v,Ei) j 


Cp 


,2Nej^, 


E2 




Aej,e2 


(6.27) 


F^i.e2 , / r \ 

IP L 




fc=I jEB[k,A^,i-^) 


F^1,E2 

^ E 

k=l 


nexp - 


/A£, £, n^ 


^Ei,E2‘ 
aH/It-I) 
^^Ei,E2 




^ n exp - - 


/A£, £2 n^ 


iV, 


El,£2' 
1 /(T-I) 
El,£2 


+ £2> 
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where the second inequality is a consequence of Lemma |6.9[ 
Next, on the set Fi n F^, 




Wl 


jeBiv.ei) 

for any v e Further, hy [T^ Theorem 1.4], n~PY.jE^[i) converges in distribution to a 

’£2 


positive random variable. Hence, there exists > 0 such that 


limsup P Fi nF|n 


jminf X 


(6.28) 


^ limsup 

n—►oo 


E 


Wj^^f^nP\^e2. 


The result follows upon combining | |6.25t , ( |6.27t and l |6.28t . 

We are now ready for the proof of Lemma f 
Proof of Lemma 6.1| Using Proposition!^ 
that 


P('^/(A)n[F] = 0) ^e/2. 

By Lemma[6H0! we can choose ^ > 0 and an integer n such that 


6J_ 

for any i ^ 1 and £ > 0, we can choose K such 

(6.29) 


min ( Y u;,] ^ I ^ £/(2J<Q 


(6.30) 


for all n ^ n and A: g [Fj. Combining | |6.29t and l |6.30] l, we see that 


P |m|.”’(d)j >1/^1 ^£ for n 

which yields the desired tightness. 

7. Proofs: Fractal dimension 


^ n, 


In this section, we prove the assertions about the box-counting dimension. Throughout 
this section, C, C' will denote universal constants whose values may change from line to line. 

We first prove a similar result for the component of j, ^(j). Consider ‘i^(l), and as 
usual, view "^(l) as a metric measure space via the graph distance and by assigning mass 
Pi, Wi,l[Y.ee^[i) PPe) to vertex v g ^(1). Set p := [pu : v g ^(1)). Now note that condi¬ 
tional on the vertex set of ‘^^(l), ^(1) has the same distribution as the graph #to(P> <^) where 
a - (1 + Wj)^l£n- Using jl7 Proposition 3.7] and (Tt! Lemma 3.1], it is easy to 


verify that the conditions in Assumption 4.4 hold with this choice of a and p. Thus, by Theo- 


rem |4.5[ n“'^'^^(l) converges in Gromov-weak topology to a limiting space that we denote by 
^{1). Further, the sequence {«”^'^(l)}„^i satisfies the global lower mass-bound property 
by Lemma[6.10[ Hence, 


n 




(7.1) 


with respect to the Gromov-Hausdorff-Prokhorov topology. By similar arguments, we can 

show that n~P'^[i) Ji (j) with respect to the Gromov-Hausdorff-Prokhorov topology for 
any j and an appropriate (random) compact metric measure space ^ij). In Section[7Hj 
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we identify the upper box-counting dimension, and in Section [7^ the lower box-counting 
dimension. 

7.1. Upper bound on the Minkowski dimension. The key ingredient in the proof is the fol¬ 
lowing lemma: 

Proposition 7.1. Write tt = (t - 2) / (t - 3). Then for every j ^ I, 

P^dimi^ij)) > 7rj = 0. 


Proof: For simplicity, we work with 7 = 1. The proof is similar for any j ^ 2. Recall that 
6] denotes the minimum number of open balls of radius 5 needed to cover the com¬ 
pact space Write 


and (7.2) 

Since the convergence in ( |7.11 l holds with respect to the Gromov-Hausdorff topology, for ev¬ 
ery x,e > 0, 


’ (91(00) (2£) > x) ^ limsup P (91(„)(£) > x). 


Fix an arbitrary d > 0 and, for any £ > 0, define 


Xe := £ 


-S-n 


M£:=|log£|, 5'-.= - 


T-l 

,t-2 


and N{e) = E 


-S'-Hr] 


(7.3) 


(7.4) 


Let En be the event defined in | |6.3k Clearly, on the event En n {91(„)(£) > Xe\, any v e^{\) is 
within distance en^ from a point in ^(1) n [iV(£)]. Hence, 

P(9I(„,(£) > x^) ^ P{E%) + P(|^(l) n [iV(£)]| ^ X£), (7.5) 

and, by Proposition|6.3[ 


limsup P{Ef) ^ c^/cxpf-C/e^'']. 

n ' ' 


(7.6) 


It remains to bound P (|^(1) n [Nie)] \ ^ Xe). To this end, note that by Proposition 3.7], 


N{e) 


(7.7) 


n [iV(£)] I ^ E ('^^(1)(0)) - 

q=l 

where and are as defined around ( |6.18k Further, 145] Theorem 1.4] implies 

the existence of positive constants Ai and Az such that 

P(^j?’(i)(0) > Ue] ^ AiexTpi-A2ul~^). (7.8) 

Combining 17.51 , 17.6} , 17.71 and 17.8} , we conclude that, for any Ue > 0, 

limsup P(91(„)(£) > Xe) ^ C^^expl-Ce”^’^]-t7liexp(-7l2ME^)+ 1^ E ^q^^e^ ^ Xe .(7.9) 

n ^ ’ U=1 ) 


Now (Ue) are i.i.d. Bernoulli random variables with 

PiJ^q (Ue) = 1) = 1 - exp =: Pq’ 

where a is as in 16.161 . Choose s > 0 small so that e^ - 1 ^ 2s. Clearly 

Eexp(sJ^^^(We)) = l + p^y(e^-l) ^exp(p^(e^-l)) ^exp(2sp<^). 
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Hence, there exists a constant A 3 > 0 such that 

(N[e) 


me] 


Y, iUe) ^ Xe U exp -SXg +2sYPq\ 


(7.10) 


\q=l 


q=l 


^ exp(-5X£: + 25A3MeiV(£)^) 

= exp|-SXe + 2sA3M££”2“^j . 

Comhining I l7.3h l |7.9| l and ( |7.101 l, we see that P' ((^(^ 0 ) {2/k) > < 00 . Since d > 0 was 

arbitrary, we conclude that 

log (91,,., ( 2 /A:)) 

limsup -;-;- ^71 U.S. 

k ^ log(A:/ 2 ) 


By sandwiching e between 2/ (A: -1) and 21k, we get the desired upper bound on dimi^ (1)). 


Proof of upper bounds in l |1.8| l and l |1.201 l; We only give the proof of l |1.8| l. This will imply 
( 11.201 1 because of ( |2.131 l. Fix i ^ 1 and let 


Kn:=mm{j e[n]: je%U)}- 

By Proposition |6.2[ Kn is tight. By passing to a subsequence if necessary, we can assume that 
we are working on a space where 


(n-''M“(A),ii:„)-(M”(A),^:oo) a.s. 


for some (integer-valued) random variable Koo. Then 

P |dim (M“(A)) > 7rj = ^ P |dim (MH(A)) > n, Koo = 7 j • 

7=1 

P |dim (MH(A)) > 7T, Koo = j) = 0 for every j ^ 1 , and hence 

dim(M™(A)) ^ :;r a.s. (7.11) 

This completes the proof of the upper bound on the Minkowski dimension. ■ 


By Proposition|7.l| 


7.2. Lower bound on the Minkowski dimension. We next extend the argument for the up¬ 
per bound to prove a lower bound on the Minkowski dimension of ^ ( 7 ). As in ( |7.3h 

IP (91(00) (e/2) < x) ^ lim sup P (91(„) (e) < x). 

n 


Recall the definitions in | |7.2h and for an arbitrary d > 0 and e > 0, adapt l |7.4t to 


Ae := £ 


8-71 


6':- — il-h), and 
n 


me) = e 




where tt = (t - 2) / (t - 3) as in Proposition|7.1[ and > 0 is sufficiently small so that 


K 3 := 2 -d-(l-d') 


(3t-8\ t-2 


, r-3 


+-> 0 , and K 4 := 1 - d - (1 - 5') 

T -3 


(2t-5\ t-2 

+-- >CT.12) 


t-3 


t-3 


(A simple calculation will show that it is possible to choose h> 0 small so that l |7.12| | holds 
whenever t > 3.) 
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The main result in this section is the following estimate on {e) := 

Proposition 7.2. There exist k > 0 and c> 0 such that 

limsup < x^) ^ ce’^. (7.13) 

n 

Consequently, for every j ^ I, 

P (dim (.y^ ij)) <7 t]-0. 


The rest of this section is devoted to the proof of Proposition |7.2[ As in Section [7d] and for 
simplicity, we work with 7 = 1. The proof is similar for any j ^ 2. Before starting with the 
proof, we collect some preliminaries. The proof helow relies on two asymptotic hounds on 
|^(1)|. For this, we use 

limsup P(n”^|^(l)| ^ s) = ^ s), (7.14) 

n 

where (Ois defined around ( |6.18| l . Our main result on the lower tails of the distribution 
of ^^( 1 ) (0) is in the following lemma: 


Lemma 7.3 (Lower tails of ^^[i) (0)). There exists C > 0 such that 

P(^.:^(i)(0)^5)^C5. (7.15) 

Proof: We note that 

P(^^(i)(0)^5) = P(3t^5: 5^f(l) = 0). 

We split 

b 

- b-abt + ct + (7.16) 
a 

where, abbreviating dj - a! 

OO OO 

@f= Y. dj[Njit) 

j>2 j>2 


Here (Wj(t))f^o are independent rate dj Poisson processes. Thus, (^f)fso is a Levy process, 
while (@f) f^o subtracts the multiple hits. When b>0 and t ^ s with s small, and using that 
is non-decreasing, 

P (^.5^(1) (0) ^ 5 ) ^ PI inf^^f ^-a/4j-tP(@5^ a/4). (7.17) 


We start with the latter contribution. Since, for a Poisson random variable Z with parameter 
A, 


E[(Z-1)]1{z>2)] = 

k>2 


¥ 


1 A^-2 A2 

A y re ^^ , 

(k-2)! 2’ 


’ ^ a/4) ^ -E[@d djidjsf 


a 


a 


t>2 


a 



we have 
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For the first term in l |7.171 l, we use Doob’s L^-inequality to bound 

16 16 “ 16 “ 


[ inf Skt ^ -a/ 4 ] ^ ^ X rf/Var(iVj(s)) = ^ ^ (fAdjS) = ^ X 

We[0,5] > oA ^ j>2 ‘^ j>2 f»2 ^ 


SO that (7.151 follows. 


Lemma 7.4 (Cluster weight convergence). For a set of vertices Aq [n], let wiA) - Y.aeA 
denote its weight. Then, asn^ 00 , for every j ^ 1, E[n~P wlffij))] remains uniformly bounded 
asn^oo, where p is as in 


Proof: Fix K^Oso large that 


V 


(^^+1)^ ...2, 1 a 


f ^ 1 

ie[n]\[K] ^ 


+ |A|a-'? 


This is possible, since £n/n^E[W], while 


p ^ I " mri/inn ^ i L-, 

!e[«]\[jir| 


E[W]n 
^l-C’n~^K'^, 


(7.18) 


(7.19) 


i^K 


where we have used l |6.13| in the second step to lower bound We write ^{A) - 

UaeA^t'^)- Then, for j ^ K, we bound 

wi^ij)) ^ wmim. 

We next investigate E[a;(^([i<f]))]. Note that 

Pij = ^ ^ + Aa-'')Ep', 

^ n 

where £„ - 'Lie[n] is the total weight. Thus, for any AQ[n\, 


P(dist(7l,7) = /)^ E E Y\Pis i,is’ 

aeA ii,...,ii_ie[n] ^=1 

where io - a, ii - j and the sum is over distinct vertices not in A. Using the bound on pij 
and performing the sum over f,..., z/_i, we obtain that 


^ Wn W j t 

P(dist([.f:],7) = Z)^(l + Aa-'?) E -^((l + Aa-’^)v‘„^ 




ae[K] 

Wj 


l-l 


Bv([7l^, 

As a result, for large n, 


^a + An~^)w{[K])-^ a + An~^)v 


(1 + An“'?)v‘yi^ ^ l-a“'?. 


w- 


(fc+i) 

n 


l-l 


E[u;(^([.f:]))] ^2a;([.f:]) 1+ E 7 ^ E^l “ 

je[n]\[K] 


i + Ed-^”'')^ 
1^0 


^^w{[K])n^. 


(7.20) 


^4a;([.f:]) 
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Since, by an argument similar to l |7.19| l, 




we arrive at 

E[M;(^([i<:]))] ^ = CRPnP. 

This completes the proof. ■ 

By ( I7.141 I and Lemma [Tlsj 

limsup V[\^{\)\^e^^'^nP]^Ce^^'^. 


We conclude that 

limsup < x^) ^ limsup < xj n {\'ioil)\> e^’^^^nP}] + (7.21) 

n n ^ ' 

We now study the event in | |7.211 i . We note that f)T(„) (e) ^ (e), which is defined as 


me] 


^ ~ 1 + ^ ll{;e^(l)}ll{dist<g>(i)(I,[j-I])>4e«'?}> 

i =2 


(7.22) 


where dist<^(i, (A, B) is the graph distance between the sets of vertices An^il) and B n ‘i^(l). 
Indeed, we start counting in the order i ^ 1, and determine whether an extra ball is needed 
to cover vertex i after we have covered the vertices in [i - 1] n . The first contribution in 
( 17.221 1 comes from the ball that covers vertex 1. 

Use inclusion-exclusion to write X‘"’(£) as 


X'"’(e) = X{"’(e)-X‘"’(e), 


where 


Therefore, 


me) 

^ l{Ie'^(I)}> 


1 = 1 


mA 

i =2 


p({91(„,(e) <x^}n{|^(l)| ^p({X{"’(e) <2x^}n{|'^(l)| >£'^^'^nP})+P(X'"’(£) >xj. 

(7.23) 

We will show that the limsup as n ^ oo of the first probability is bounded by Ce’^L and the 
limsup as n ^ oo of the second by Ce^'^ with ki,K 2 > 0, so that Proposition |7.2| will follow 
with K - mm{ 6 h/ 2 ,Ki,K 2 }. 


Analysis of XJ"'. It follows from 117 Proposition 3.7] that 


limsup P {X{"’(£) <2x^}n{|'^(l)| >£^^'2nP}UP(X^(£) ^2xJ, 

n^oo ' ^ 


(7.24) 


iy(£) 

Xi(£):= Y 

i=l 


where 
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is a sum of independent indicators with success probabilities 1 - exp( - i - 

1,.. .,N{e) (recall l |6.171 ), with dj as defined right below l |7.161 l. Note that 

N(e) NiE) NiE] 

E[Xi(e)] = ^ = 1) = [l -exp(-di/^'2)] ^ Y. ^ Ce^^'^N{e)P. 

i=2 i=2 i=2 

(7.25) 

Similarly, for small enough e > 0, 

N(e) N(e] 

E[Xi(e)]= Y [l-exp(-d,e'^^'2jj ^ ^ C'e^^'^N{e)P 

i=2 i=2 

Further, since (e) is a sum of independent indicators, 

Var(Xi(e))^E[Xi(c)]. 

Combining I l7.24| |, ( |7.25h l |7.26h and l |7.27| l, we get 

limsup P[{X'^^\e)^2x^}r\{\^{\)\>e^^'^nP}] 

n—►oo 

^ P(Xi(e) ^ 2xJ ^ p(|Xi(e) - ElX^ie)] | ^ x,) ^ xJ^VariX^ie)) 

^x-Mx^{e)]^Ce^\ 

where Xi = 2;r - 2d + dfi/2 - :n:(l - d') > 0 when d > 0 is sufficiently small. This proves a bound 
on the first term on the right side of ( |7.231 l. 


(7.26) 


(7.27) 


(7.28) 


Analysis of X^^\ We next give an upper bound on PiX^ (e) ^ x^). We start with 


'">( 
“l[rr vMi 


p(xf (e) ^ xj ^ x7E[X'"’(£)]. 


Further, 


iy(e) 


E[X^">(£)]= Y P(iG^(l),dist^(i,(i,[i-l])^4£n4 

i=2 '■ ■' 


(7.29) 


(7.30) 


When i e ^(1) and dist<^(y(i, [i - 1]) ^ 4enP, there has to be j e [i - 1] and k e [n] such that 
the three events 

(i) {dist(i. A:) ^4en^}; 

(ii) {dist( 7 , k) ^ AenP}] 

(iii) {A:g<^(1)}, 

occur disjointly, where dist(i, 7 ) denotes the graph distance in the random graph NR„(u/). 
There are two cases depending on whether k > N{£) ov k ^ iV(e). When k ^ N^e), we can 
ignore the event {distij, k) ^ 2en''}. This gives, for 2^ iV(e), 

G^(l),dist<^,i)(i, [i- 1]) 

NiE) 

L L ^ ({dist(i,A:) ^ den''} o {distij, k) ^4en''}o{A:e‘d’(l)}j 

■ ’ k>NiE) ^ ' 

Me) 

Y P|^{dist(i, A:) ^4en''}o{A;G^(l)}j, 


j=l k>N{E) 
NiE] 

+ 

fc=l 
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where, for two increasing events A,B, we write Ao B for the event that A and B occur dis- 
jointly. 

By the BK inequality, we hound 


me) 


“fi e‘^(l),dist<g>,i)(i, [/- 1]) ^ den''! ^ ^ ^ P(dist(i,A:) ^4en'')P(dist(7,A:) ^4£n’^)P(A:e‘^(l)) 

^ j=l k>NiE] 

N{e] 

+ ^ P(dist(i, A:) ^4£:n'')P(A:e'^(l)). 

k=l 


Similar to I 17.20L we have 

.. r A \ WiWj ^ ,, 

P(dist(G7) =/) ^ (l + —J—\ 

where v„(A) = (1 + Xn~^)Vn- In our case, v„ = 1 + 0{n~^), so that, for I ^ 4eiiA, 


P(dist(i, 7 ) 


IVi Wj 
£n ' 


Further, 


^ lAJ lAJ 

P(A:g^( 1)) = ^ P({ZG^(l)}o{A:Zoccupied})^]l{fc=i)+ ^ (l + —)-^^P(Zg^(1)) 


/e[w] 


le[n\ 


rfii £r 




n^i £ri 


where we recall that w{A) - Y.aeA denotes the total weight of A. By Lemma [7^ 
E[n~P wij^il))] remains uniformly hounded as n^oo. We conclude that 


“(z G '^(l),dist<g.(i,(Z, [Z - 1]) ^ den’ll 


iy(e) ,,..,,..,, .3 NiE) .2 

Y - ^ + CenP^P Y+ - 


/3 

7 = 1 k>N{E] 

NiE) 


k-2 * n 


el ' er 


^ (-^^2^2j7+p-3+5/(t- 1) ^ ^ -l/(T-l)y-l/(T-I)^-3 /(t-1) 


7 = 1 k>NiE) 
NiE) 


+ Y + .. i 


t;-1+2/(t-1) -1/(T-1) 


fc =2 


where the last step uses the first inequality in | |6.13| . Note that 

2i7 + p-3 + 5/(T-l) = 77 + p-2 + 3/(T-l) =77-1 + 2 /(t- 1) = 0, 
so that the powers of n cancel. Combining the above with ( |7.301 l leads to 


,jy(e) 


E[a:‘"’(£)] ^Ce^f Y 
1=1 


, n NiE) NiE) 

Y + CeY Y 


k>NiE) 


A:=l 


Note that 

N 

Y for p = 1,2, and Y 

7 = 1 k>N 
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Thus 


E[X“(e)] ^Ce^me) 


2(t-2)/(t-1) + (t-4)/(t-1) 


c 


e^Nie) 


(3t-8)/(t-1) 


+ CeNie) 


(t-2)/(t-1) + (t-3)/(t-1) 


Using ( |7.29| l and plugging in the values 77 = (t - 3) / (t - 1 ), tt = (t - 2) / (t - 3), we arrive at 


limsup P(X‘"’(£) ^ xj ^ Cx~^ 


e^Niey 


+ eN{ey 




where the exponents K 3 and K 4 are positive because of the choice of 6 ' (see l |7.12t ). 


Completion of the proof of Proposition |7.2[ Note that l |7.13t follows upon combining 
\7.2\) , l |7.23i , ( |7.28| , and l |7.31i . Now fix p > 1/jc, where k is as in | |7.13i . Then 
P (91(00) (1 / kf’) < {2k) < 00 . Since d > 0 was arbitrary, we conclude that 

. log (91(00) ( 1 /A:P)) 

limint- ^Tt a.s. 

k \og{kP) 

By sandwiching £ between 1/(A:- 1)P and l/k^, we obtain the bound: dim (^(l)) ^ n a.s. ■ 

Proof of l |1.8| and ( |1.201 l: Proposition |7.2| com bine d with an argument identical to the the 
one given right after the proof of Proposition |7. 1 1 yields the lower bound: dim(M™(A)) ^ 
n a.s. I |1.8| l follows once we combine this lower bound with l |7.11| l, and | |1.20t follows as a 
consequence of | [27T3) . ■ 


8. Open PROBLEMS 


In Theorem |1.8[ we have considered a general entrance boundary c g Zq. To study specific 
properties of the limit objects, we focused mainly on the special case c = c(a,T) as in | |1.19| l 
and in this case, we have shown compactness and identified the box counting dimension in 
Theorem 1 1.9[ An important problem in this context is to establish necessary and sufficient 
conditions on c that ensure compactness of the limiting spaces. 

Another moti vation for pursuing this problem comes from the following simple corollary 
of Theorem 1.9 For any i ^ 1, consider the sequence 0'"’ as in | |2.10t . Then is almost 


surely compact. Similarly, compactness of M (1) (as defined in | |7.11 l) implies compactness of 
the associated ICRT where 0 = {Of. i^l) is given by the following prescription: Let qk 
be such that 

Cfk 

^ ( 0 )) = k, 

^/=l 


where J^q{-) and ^^(i)(0 are as defined around l |6.18| l. Define 


-1/(T-1) 


Oi 




2/(T-l)j 


1/2 


for 


7>1. 


These can be thought of as “annealed results," since 0“’ and 0 are random. No result is 
known in this direction without a prior distribution on 0 , i.e., sufficient conditions on non- 
random 0 G 0 that ensure compactness of the tree are not known. In ^ Section 7], 
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Aldous, Miermont and Pitman conjecture that boundedness of for 0 e 0 is equivalent to 
/“ iy/ff iu))~^du<oo, where y/e, in our situation, is given by 

OO 

(exp(-M0d - 1 + udi). 
i=l 

This conjecture, however, is open to date. Our proof technique demonstrates a method of 
proving such annealed results via approximation by random graphs. Thus, classification 
of those c G Zo for which the spaces Mf (A) are compact will lead to a broad class of prior 
distributions on 0 for which is compact. 


Problem 8.1. Find necessary and sufficient conditions on c that ensure compactness of the 
spaces M^(A) for i^l. 


Another related problem is to find the fractal dimensions of the limiting spaces. As a corol¬ 
lary to Theorem|1.9[ we get 


dim j = (t - 2) / (t - 3) a. s. 


( 8 . 1 ) 


whe re 0'“’ is as in | |2.10| l corresponding to c of the form ( |1.191 l . Proposition|7.l|and Proposition 


7.2 


show that the assertion in ( |8.11 l remains true if we replace by 0. Now, it is not hard to 
prove that 


inf Ojj 


I/(t-2) 


>0 a.s. and sup 6 ^’<oo a.s. 


It then follows that 

T-2 = sup]a3:0 : lim = cxal = inf] a ^ 0 : Urn u~‘‘y/Q{u) = 0> a.s., 

V ht ^OO } V Vt ^OO } 

which in turn implies that both the Hausdorff dimension and the packing dimension of a y/g 
Levy tree equal (t - 2)/(t - 3) a.s. (see Using the analogy between ICRTs and Levy 


trees as in 111 


IS the heuristic behind Conjecture [L^ 


.y^(l). This: 

Problem 8.2. Prove Conjecture \l.3\ 


Section 7], it is natural to expect that the same is true for and hence for 
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Space of tenable parameters giving rise to ICRTs.10 

Tilt functional and associated tilted p-tree distribution. fl7 
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